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The research leading to the present invention was supported, at least in part, by a grant 
from the National Institutes of Health, Grant Numbers GM60346 and GM61393. 
10 Accordingly, the Government may have certain rights in the invention. 

FIRED OF THE INVENTION 

The present invention relates generally to the field of drug metabolism and drug 
15 efficacy and toxicity, particularly with respect to cancer therapeutics and 
immunosuppressants. In particular, the present invention relates to genetic 
polymorphisms responsible for altered expression of the cytochrome P450 CYP3A5 
enzyme. 

20 BACKGROUND AND SIGNIFICANCE 

There is significant interpatient variation in drug metabolism. This is 
clinically important for drugs with narrow therapeutic indices such as the 
immunosuppressive agent cyclosporin A and anti-cancer agents including etoposide, 

25 teniposide, taxol and ifosphamide (Weber G.F.and Waxman, D.J. (1993) Biochem 
Pharm 45: 1685-1694; Relling, M.V. et aL (1994) MoL Pharmacol. 45: 352-358). 
The clinical significance of wide variations in drug metabolism are realized as major 
effects on drug efficacy, drug toxicity and hence, therapeutic outcome. For these 
reasons, it is important to elucidate the factors that regulate variability in drug 

30 metabolism. 
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Genetic Polymorphisms in Drug-Metabolizing Enzymes 



The number of man-made and natural environmental substances increases at an 
exponential rate, and many of these substances pose a health risk to exposed 
5 individuals. To protect the human population from the adverse effects of these 

agents, it is desirable to be able to identify the individuals within the population who 
are at a higher risk of developing the adverse effects. Molecular biology and 
molecular genetics have become essential tools in environmental toxicology. In fact, 
it is now feasible to approach experimentally one of the most challenging questions 
10 facing toxicologists today, namely, the identification of genes that contribute to an 
increased resistance (or sensitivity) to toxic environmental agents. 

v3 

''--4 Susceptibility to chemical exposure, whether therapeutic, environmental, or 

occupational, is a well-documented phenomenon with a myriad of underlying causes, 
15 including age, health, gender, nutritional status, concominant therapy, and genetic 

- factors. It is now well established that a major determinant of host-specific chemical 

p susceptibility is the genetic variability observed in more than a dozen superfamilies of 

enzymes, collectively termed xenobiotic or drug-metabolizing enzymes. In addition 

C3 to metabolizing the vast majority of chemicals to which humans are exposed on a 

20 daily basis, the members of these complex enzyme families also participate in many 
critical endogenous processes. Many drug-metabolizing enzymes have dual, often 
opposite, functions. For example, enzymes that detoxify lipid-soluble chemicals, by 
converting them to more readily excreted water-soluble metabolites, may also be 
capable of activating inert chemicals to highly reactive intermediates that interact 
25 with cellular macromolecules such as protein and DNA. Thus, for each chemical to 
which humans are exposed, there exist two potential competing pathways, one of 
metabolic activation and another of detoxification. Often these tv^o pathways are 
exerted on the same compound, which becomes metabolically activated in order to be 
detoxified. 
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Subtle and gross differences in the genes encoding these enzymes have been 
identified and shown to resuh in marked variations in enzyme activity. These 
differences have been referred to as pharmacogenetic or, more broadly, ecogenetic 
polymorphisms (Motulsky, A,G. (1991) Pharmacogenetics 5:59). It has become 
5 apparent that each individual possess a distinct complement of drug-metabolizing 

enzyme activities and that it is the complex interplay of different enzyme variants that 
ultimately determines not only the fate of a chemical in any given individual, but also 
its potential toxicity. The development of relatively simple DNA-based tests, 
designed to identify specific gene alterations in these enzymes, can provide more 
10 accurate predictions of individual response to chemical exposure, thus expanding the 
field of preventive toxicology. 

H The vast majority of studies examining the relationship between genetic 

polymorphisms in drug-metabolizing enzymes and human health relate to the 

% 15 cytochrome P450 superfamily, the predominent phase I drug-metabolizing enzymes 

= (Nebert, D.W. and Gonzales, F.J. (1987) Ann Rev Biochem 56:945; Nebert, D.W. 

h and McKinnon, R.A. (1994) Prog. Liver Dis. 12:63). A member or members of the 

cytochrome P450 superfamily is involved in the metabolism of almost all chemicals 

P to which humans are exposed. Found in animals, plants, fungi, yeast, and bacteria, 

20 these highly versatile enzymes also catalyze the oxidative metabolism of many 

critical endogenous substrates. The advent of cDN A cloning technology has resulted 
in rapid insights into the multiplicity of cytochrome P450 enzymes, with more than 
500 distinct cytochrome P450 genes identified in all species (Nelson, D.R. et al., 
(1993) DNA Cell Biol 12:1; Nelson, D.B. et al., (1996) Pharmacogenetics 6:1). It has 
25 been estimated that humans may possess as many as 60 distinct P450 genes. 



The genes in families CYPl, CYP2, and CYP3 of cytochrome P450 code for enzymes 
that are primarily responsible for the metabolism of most procarcinogens, 
promutagens, and drugs. The dual role of these enzymes in both detoxification and 
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metabolic activation has prompted a vigilant survey of C YP genes for polymorphisms 
likely to result in variable enzymatic activities. 



5 Cytochrome P450 (CYP) 3A Enzymes and Their Substrates 

The cytochrome P450 proteins (CYPs) are a family of haem proteins resulting from 
expression of a gene super-family that currently contains over 500 members in 
species ranging from bacteria through to plants and animals. In humans, about 40 
10 different CYPs are present and these play critical roles by catalyzing reactions in: (a) 
the metabolism of drugs, environmental pollutants and other xenobiotics; (b) the 
biosynthesis of steroid hormones; (c) the oxidation of unsaturated fatty acids to 
intercellular messengers; and (d) the stereo- and regio-specific metabolism of fat- 
soluble vitamins. 

15 

Quantitatively, the CYP3 A family is the most abundantly expressed of the human 
CYP450S, representing on average 30% and 70% of total CYP450 in liver and 
intestine respectively (Shimada, T. et al. (1994) J, Pharmacol. Exp Therap 270:: 414- 
423). Four unique members of this gene family have been described. Sequences 

20 (cDNA) for CYP3A3 and CYP3A5 were cloned in the laboratory of P.S. Guzelian 
(Schuetz, J.D. et al. (1989) Arch Biochem Biophys 274: 355-365; Molowa D.T. et al. 
(1986) Proc. Natl. Acad Sci USA 83: 5311-5315). cDNAs for CYP3A7 and 
CYP3A4, the major CYP3A that is expressed in all humans have also been isolated 
(Beaune, P.H. et al, (1986) Proc. Natl Acad Sci USA 83: 8064-8068; Kitada, M. et al. 

25 (1987) J. Biol Chem 262: 13534-13537). One CYP3A5 pseudogene, CYP3A5P has 
been reported (Schuetz, J.D. et al (1995) Biochem. Biophys Act 1261: 161-165). A 
large number of drugs are mainly metabolised by the CYP3A subfamily (see TABLE 
1). Therefore, maturational changes in CYP3A ontogeny may impact on the clinical 
pharmacokinetics of these drugs. 
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TABLE 1. Important substrates for cytochrome P450 (CYP) 3A 

DRUGS 

Antifungals 

ICetoconazole 
Miconazole 
Immunosuppressants 
Cyclosporin (M 1 formation) 
Cyclosporin (Ml 7 formation) 
Tacrolimus (FK-506) 
Chemotherapeutics 
Busulfan 
Doxorubicin 
Etoposide 

Tamoxifen (also (CYP2D6) 
Vinblastine 
Vincristine 
Benzodiazepines 
Alprazolam 
Diazepam (minor) 

Midazolam (1 -hydroxy formation) Sertraline 
Midazolam (4-hydroxy formation) IVIiscellaneous 



Antihistamines 

Astemizole 
Mizolastine 
Terfenadine 
10 Antireflux 
Cisapride 
Anti-emetic 
Ondansetron 
Anticonvulsants 
15 Carbamazepine 
Cionazepam 
Ethoxisumide 
Zonisamide 
Anti-HIV 
20 Indinavir 
Ritonavir 
Saquinavir 
Antimicrobials 
Clindamycin 
25 Erythromycin 

Rifampicin (rifampin) 
XENOBIOTICS 
Aflatoxin Bl 



Anaesthesia-analgesics 

Alfentanil 
Fentanyl 

Lidocaine (lignocaine) 

Ethylmorphine 

Antihypertensives 

Amlodipine 

Felodipine 

Isradipine 

Nicardipine 

Nifedipine 

Anti-arrhythmics 

Verapamil 

Quinidine 

Antidepressents 

Imipramine 

Nefazodone 



Temazepam 
Trizolam 

Benzopyrene activation 
Heterocyclic amines 



Dextromethorphan 



Sterigmatocystin 



Benzphetamine 
30 ENDOGENOUS SUBSTRATES 

Androstanedione Estradiol 
(6p-hydroxylation) 

Cortisol 17 a-Ethinylestradiol 
(6P-hydroxylation) 

35 Dehydroepiandrosterone Progesterone (6p-hydroxylation) Testosterone (1 5p-hydroxylation) 



Testosterone (2P-hydroxylation) 
Testosterone (6P-hydroxylation) 



Dehydroepiandrosterone 
sulfate 

References for Table 1: Michalets, E.L. (1998) Pharmacotherapy 18(1): 84-1 12. 
5 Kearns, G.L. (1995) Curr Opin Pediatr 7:220-233 

CYP3A4 is the most abundantly expressed CYP and accounts for approximately 30 to 
40% of the total CYP content in human aduh liver and 70% in the small intestine. 
CYP3A5 is 83% homologous to CYP3A4, is expressed at a lower level than CYP3A4 
in the liver in some individuals - but may be the dominant CYP3A in up to 10% of 
10 Caucasians (K. Thummel, personal communication) - and is the main CYP3A 
isoform in the kidney and many extrahepatic tissues. CYP3A7 is the major CYP 
isoform detected in human embryonic, fetal and newborn liver, but is also detected in 
adult liver, although at a much lower level than CYP3A4. 
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Cytochrome P450 3A5 

The substrate specificity of CYP3A5 appears to be similar to that of CYP3A4; 
however, some differences in catalytic properties have been found. In a reconstituted 
system, the formation rate of 1 -hydroxy-midazolam is considerably higher for 
20 CYP3 A5 than with CYP3A4. In contrast, the rate of formation of 4-hydroxy- 
midazolam with CYP3A4 and CYP3A5 is similar (Gorski, J. C. et al., (1994) 
Biochem Pharmacol 47(9): 1643-53). No CYP3A5 catalytic activity was found 
towards quinidine, 1 7a-ethinyl-estradiol and aflatoxins, all substrates of CYP3A4 
(Wrighton, S.A. et al., (1990) Mol Pharmacol 38(2): 207-213). However, GiUam et 
25 al. did find considerable catalytic activity of CYP3 A5 towards both erythromycin 
(about 6 times higher when compared with CYP3A4) and ethylmorphine (Gillam 
E.M. et al., (1995) Arch Biochem Biophys 317(2): 374-84). Interestingly, Gorski et 
al. found a much better correlation between midazolam hydroxylation and 
erythromycin N-demethylation when livers containing both CYP3A4 and CYP3 A5 
30 were excluded from analysis, a finding which supports the different isoform 

specificity for these drugs (Gorski, J. C. et al., (1994) Biochem Pharmacol 47(9): 
1643-53). 



A tabulation of the results of studies examining the substrate specificity of CYP3A 
isoforms 3A4, 3 A5 and 3 A7 is presented in TABLE 2. 
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In summary, the specificity of CYP3A4 and 3A5 for the biotransformation of many 
substrates appears to be similar, although the extent and rate of metabolic conversion 
by the individual isoforms may be quite different for a given substrate. The 
discrepant results of in vitro studies probably reflect the sensitivity of CYP3A 
5 metabolic activities to incubation conditions. 

Polymorphic Expression ofCYP3A5 

To a large extent, variation in drug metabolism is due to inter-individual differences 
in expression of CYP3A. It has been estimated that the metabolism of up to one-half 
10 of all commercially available drugs is catalyzed by a single dominant cytochrome 
P450 gene subfamily, the cytochromes P4503A (Cholerton, S. et al. (1992) Trends 
Pharmacol Sci. 13: 434-439). Thus, administration of a therapeutic dose of 
cyclosporin A (a CYP3 A substrate) to a transplant patient with extremely low or high 
CYP3A is more likely to result in nephrotoxicity or graft organ rejection, respectively 
15 (Turgeon, K. et al. (1992) Clin. Pharmacol, and Therap. 52: 471-480). Similarly, 
administration of a standard dose of etoposide (a CYP3 A substrate) to a cancer 
patient with low or high CYP3A is more likely to result in hematologic or 
gastrointestinal toxicities or treatment failure, respectively (Rodman, J. H. et al. 

(1994) J. Clin. Oncol. 12: 2390-97). Significant interpatient difference, exceeding 
20 30-fold in some patient populations, has been demonstrated in both CYP3A content 

and its related catalytic activities among untreated human livers (Watkins, P.B. 

(1995) Hepatology 22: 994-996). Variation in CYP3A activities reflects, in part, the 
heterogeneous expression of the individual CYP3A family members. 

25 Variation in Expressio n of Hepatic CYP3A5, 

Polymorphic expression of CYP3A5 mRNA (Schuetz, J.D. (1989) Arch. Biochem 
Biophys. 274: 355-365) and CYP3A5 protein has been identified as one factor 
contributing to individual variation in total CYP3A expression and, thus CYP3A- 
mediated metabolism of drugs (Wrighton, S.A. et al. (1989) Mol Pharmacol 36: 97- 

30 105; Wrighton, S.A. et al. (1990) Mol. Pharmacol 38: 207-213). About 75% of the 
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Caucasian population fail to express hepatic CYP3 A5 mRNA or protein; these 
individuals are defined as CYP3A5 non-expressors. Within the 25% of the population 
defined as CYP3A5 hepatic expressors an additional level of variation in CYP3A5 
expression exists. Among these individuals, the previously published content of 
CYP3 A5 protein varies from 2 to 220pmol CYP3A5/mg microsomal protein, and 
with respect to total CYP3A, ranges from <1% to as much as 100% (Aoyama, T. et 
al. (1989) J. Biol Chem 264: 10388-10395; Wrighton, S.A. et al. (1990) Mol 
Pharmacol 38: 207-213). The average content of CYP3A5 protein among expressors 
is reported as only approximately 30% that of CYP3A4, however, the factors 
controlling the expression of CYP3 A5 are unknown. 

rYP3A5 is th e, predominant extrahepat i r, ex pressed form of CYP3A , 
In contrast to human liver, the majority of human kidneys express (:YP3A5 protein 
(Schuetz, E.G. et al. (1992) Arch Biochem Biophys 294: 206-214), and a more recent 
report found that all human kidneys express CYP3 A5 mRNA and protein but in a 
bimodal fashion -25% express high levels of CYP3 A5; 75% express low levels of 
CYP3A5. (Haehner,B.O.etal. (1995) ISSX Proceedings 8:352 (Abstr). CYP3A5 
mRNA is also reported to be expressed in human anterior pituitaries (Murray, G.I. 
(1995) FEES Lett 364: 79-82). In contrast, the dominant hepatic form, CYP3A4, is 
only infrequently expressed in these extrahepatic tissues. These findings demonstrate 
that CYP3A5 is the predominant if not exclusive CYP3A family member expressed 
in most human extrahepatic tissues. 

In order to understand the basis for the variable and polymorphic expression of 
25 CYP3 A5 in the liver and the factors governing its expression in extrahepatic tissue, it 
is necessary and desired to identify the molecular mechanisms regulating its 
expression. The tissue-specific and developmental regulation of CYP3A5 suggests 
that the basis of the hepatic polymorphism lies in the regulation of its expression, 
rather than in the CYP3A5 structural gene. Hashimoto et al. have reported the gene 
30 structure of CYP3 A4 and its transcriptional control (Hashimoto, H. et al. (1 993) Eur. 
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J. Biochem 218: 585-595). A unique element regulating hormonal responsiveness 
(dexamethasone) of CYP3A5 was reported by Schuetz et al. (Schuetz, J.D. et al. 
(1996) Mol Pharmac 49: 63-72). Nonetheless, little, if anything, is known about the 
factors controlling transcriptional activation. 

5 

Given the significant and important relevance of interpatient variation in drug 
metabolism to the success, choice, toxicity, and dosage of therapeutic drugs, the 
ability to monitor and predict drug metabolism or responsiveness is paramount. A 
knowledge of, and the ability to predict a patient's drug metabolism and appropriate 
10 therapeutic index, by determining the interindividual differences in expression of 
CYP3A, including CYP3A5 and CYP3A4, is therefore needed. 

The citation of references herein shall not be construed as an admission that such is 
prior art to the present invention. 

15 

SUMMARY OF TNR INVENTION 

The present invention is based upon the identification of polymorphisms that account 
20 for variable levels of expression of the CYP3A5 protein among the human 

population, which in turn accounts for variable metabolism of CYI'3A5-metaboUzed 
drugs among the human population. Such polymorphisms occur in intron 3 and exon 
7 oftheCYP3A5 gene. 

25 The present invention extends to diagnostic assays, kits and methods for determining 
the cytochrome P450 3A5 (CYP3A5) genotype of a subject, thereby providing a 
means to determine the expression or activity of cytochrome P450 3A5 (CYP3A5) in 
a subject. This is particularly relevant in determining and assessing interpatient 
variation in drug metabolism. Specifically, in evaluating or determining appropriate 
30 drug therapy in patients undergoing cancer treatment or on immunosuppressant 
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• therapy, for instance, the determination of cytochrome P450 3A5 (CYP3A5) 

genotype, and thereby predicting the expression of cytochrome P450 3A5 (CYP3A5) 
in organs or tissues, is relevant and usefol. It may also be usefol in the determination 
of the individual predisposition to cancers, particularly those caused by environmental 

5 carcinogens. 

The present invention provides sequence variations in the CYP3A5 gene that are 
responsible for hepatic polymorphism in cytochrome P450 3A5 (CYP3A5) 
expression., whereby the encoded CYP3A5 polypeptide is prematurely terminated. 
10 the invention provides sequence variations in the CYP2A5 gene, particularly in 

ir^tron sequences, specifically intron 3, whereby the encoded CYP3A5 polypeptide is 
prematurely terminated. In addition, the invention provides exonic sequence 
variations in the CYP3A5 gene, particularly in exon 7, whereby the encoded 
CYP3A5 polypeptide is prematurely terminated. Determination of the CYP3A5 
15 sequence, and particularly of the intron 3 sequence genotype of an individual and/or 
of the exon 7 sequence genotype of an individual, can be utilized in predicting the 
expression or activity of cytochrome P450 3A5 (CYP3A5) in a subject, thereby 
determining and assessing inter-individual variation in drug metabolism. 

20 In particular, a polymorphism is identified in intron 3 of the CYP3 A5 gene at the 
position corresponding to nucleotide 22,893 of Genbank sequence accession no. 
AC005020 and in exon 7 of the CYP3A5 gene at the position corresponding to 
nucleotide 30,597 of Genbank sequence accession no. AC005020. 

25 A sequence variation in the CYP3 A5 gene wherein an Adenine (A) in intron 3 at the 
position corresponding to nucleotide 22,893 Genbank sequence accession no. 
AC005020 is altered to a Guanine (G) in this same position in one of the 
polymorphisms identified herein. This polymorphism leads to the expression of a 
truncated CYP3A5 protein with little or no catalytic activity. 



30 
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The invention further provides a sequence variation in the CYP3A5 gene wherein a 
Guanine (G) in exon 7 at the position corresponding to nucleotide 30,597 of 
Genbank sequence accession no. AC005020 is altered to an Adenine (A) in this same 
position. This polymorphism leads to expression of an altered CYP3A5 with low 
5 catalytic activity that is missing most of the amino acids encoded by exon 7. 

According to the present invention, the presence of either the A to G polymorphism 
at the position corresponding to nucleotide 22,893 of Genbank sequence accession 
no. AC005020 or the G to A polymorphism at the position corresponding to 
nucleotide 30,597 of Genbank sequence accession no. AC005020 in the CYP3A5 
gene is predictive of reduced CYP3A5 expression. 
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In one aspect of the invention, methods for determining the genotype of a subject 
with respect to these polymorphisms are provided. Diagnostic assays and kits based 
15 on these methods are also provided. These methods may be used to diagnose and 
assess the ability of a subject to metabolize drugs that are metabolized by the 
CYP3 A5 protein, as well as to assess interpatient variation in the metaboUsm of such 



drugs. 
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RT^TF.F DESCPTPTTON OF TWP. DRAWINGS 



FIGURE 1 depicts splice variants of CYP3A5. Schematic drawing of the cDNA 
structures of the various CYP3A5 alleles. Exons are numbered and boxed, and 
25 primersPll and P6 for exons 2 through 8 indicated. The location of intronic 
sequences coding for exons 3B, 4B and 5B derived from intron sequences are 
indicated as darkened boxes. 

FIGURE 2 depicts the relationship between CYP3A5 phenotype and CYP3A5 
30 genotype in livers of Caucasians (a) and African Americans (b). Microsomal content 
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of CYP3A5 protein in human livers (HL) was determined by immunoblot analysis. 
Identification of the CYP3AP1 and CYP2A5 genotypes were made by direct 
sequencing of genomic DNA; the CYP3AP1 *1/*1 , CYP3AP1 *l/*3 and 
CYP3AP J *3/*3 genotypes (GG, AG and AA, respectively at nt -44 of the CYP3AP1 
5 promoter). The CYP3A5 genotypes are defined in FIGURE 1 . 

FIGURE 3 depicts a mismatch primer for CYP3A5*3 versus CYP3A5*1 genotyping. 
The mismatch primer generates an Tru9 I/Msel restriction site on amplification of 
CYP3A5*1 genotype nucleic acid. 
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FIGURE 4 depicts the strategy for mismatch directed PGR for CYP3A5*3 versus 
CYP3A5*1 genotyping. With Tru9 I/Mse I cleavage after PGR amplification, the 
22,893 A genotype generates cleavage products of 3 14 and 20 bp, while the 22,893 
G genotype generates a product of 334 bp. 

FIGURE 5 depicts a mismatch primer for CYP3A5-6 versus CYP3A5*1 genotyping. 
The mismatch primer generates an Tru9 I/Msel restriction site on amplification of 

C7P3v45*6 genotype nucleic acid. 

FIGURE 6 depicts the strategy for mismatch directed PGR for CYP3A5*6 versus 
CYP3A5*1 genotyping. With Tru9 I/Mse I cleavage after PGR amplification, the 
30,597 A genotype generates cleavage products of 151 and 26 bp, while the 30,597 
G genotype generates a product of 177 bp. 
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nFTATl.ED PPSGRTPTION 



In accordance with the present invention there may be employed conventional 
molecular biology, microbiology, and recombinant DNA techniques within the skill 
30 of the art. Such techniques are explained fiilly in the literature. See, e.g., Sambrook 
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et al, "Molecular Cloning: A Laboratory Manual" (1989); "Current Protocols in 
Molecular Biology" Volumes l-Ill [Ausubel, R. M., ed. (1994)]; "Cell Biology: A 
Laboratory Handbook" Volumes I-III [J. E. Cells, ed. (1994))]; "Current Protocols in 
Immunology" Volumes I-III [Coligan, J. E., ed. (1994)]; "Oligonucleotide 
Synthesis" (M.J. Gait ed. 1984); "Nucleic Acid Hybridization" [B.D. Hames & 
S.J. Higgins eds. (1985)]; "Transcription And Translation" [B.D. Hames & S.J. 
Higgins, eds. (1984)]; "Animal Cell Culture" [R.l. Freshney, ed. (1986)]; 
"Immobilized Cells And Enzymes" [IRL Press, (1986)]; B. Perbal, "A Practical 
Guide To Molecular Cloning" (1984). 

Therefore, if appearing herein, the following terms shall have the definitions set out 
^2 below. 

w 

i The terms "cytochrome P450 3A5", "CYP3A5", "CYP450 3A5", "3A5" and any 

h 15 variants not specifically listed, may be used herein interchangeably, and as used 

^ throughout the present application and claims refer to proteins, polypeptides and 

S enzymes, and extends to those heme proteins capable of metabolizing drugs and 

S other chemicals or agents, including as provided and detailed herein particularly in 

Q Table 1 and Table 2. Accordingly, proteins displaying substantially equivalent or 

20 altered activity are likewise contemplated. These modifications may be deliberate, 
for example, such as modifications obtained through site-directed mutagenesis, or 
may be accidental, such as those obtained through mutations in hosts that are 
producers of the enzyme. Also, the terms "cytochrome P450 3A5", "CYP3A5", 
"CYP450 3A5", "3A5" are intended to include within their scope proteins 
25 specifically recited herein as well as all substantially homologous analogs and allelic 
variations. 

A "replicon" is any genetic element (e.g., plasmid, chromosome, virus) that fiinctions 
as an autonomous unit of DNA replication in vivo; i.e., capable of replication under 
30 its own control. 
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A "vector" is a replicoii, such as plasmid, phage or cosmid, to which another DNA 
segment may be attached so as to bring about the repUcation of the attached segment. 



5 A "DNA molecule" refers to the polymeric form of deoxyribonucleotides (adenine, 
guanine, thymine, or cytosine) in its either single stranded form, or a double-stranded 
helix. This term refers only to the primary and secondary structure of the molecule, 
and does not limit it to any particular tertiary forms. Thus, this term includes 
double-stranded DNA found, inter alia, in linear DNA molecules (e.g., restriction 
10 fragments), viruses, plasmids, and chromosomes. In discussing the structure of 
particular double-stranded DNA molecules, sequences may be described herein 
according to the normal convention of giving only the sequence in the 5' to 3' 
direction along the non-transcribed strand of DNA (i.e., the strand having a sequence 
homologous to the niRNA). 

15 

An "origin of replication" refers to those DNA sequences that participate in DNA 

synthesis, 

A DNA "coding sequence" is a double-stranded DNA sequence which is transcribed 
20 and translated into a polypeptide in vivo when placed under the control of 

appropriate regulatory sequences. The boundaries of the coding sequence are 
determined by a start codon at the 5' (amino) terminus and a translation stop codon at 
the 3' (carboxyl) terminus. A coding sequence can include, but is not limited to, 
prokaryotic sequences, cDNA from eukaryotic mRNA, genomic DNA sequences 
25 from eukaryotic (e.g., mammaUan) DNA, and even synthetic DNA sequences. A 

polyadenylation signal and transcription termination sequence will usually be located 
3' to the coding sequence. 

The terms "CYP3A5'\ "CYP450 3A5", "3A5", "CYP3A5-r\ "CYP3A5*3", 
30 "CYP3A5*6", "CYP3A5*1B", "CYP3A5*1C", and any variants not specifically 
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listed, and as used throughout the present appUcation and claims refer to genes, 
nucleic acid, alleles, allelic variants and DNA, including exonic and intronic 
sequences, capable of, or responsible for, encoding or expressing a protein, 
polypeptide and enzyme cytochrome P450 3A5. The cytochrome P450 3A5 may be 

5 expressed at a high (expressor) or very low (non-expressor) relative level or amount, 
or when expressed may be active (expressor) or inactive (non-expressor), and may be 
active in metabolizing drugs and other chemicals or other agents, depending on the 
gene, allele or allelic variant. The cytochrome P450 3A5 expressed or encoded by 
the above termed gene, allele or allelic variant may be prematurely terminated 

10 (generally inactive), depending on the gene, allele or allelic variant. 

A "nucleic acid molecule" or "nucleic acid" refers to the phosphate ester polymeric 
form of ribonucleosides (adenosine, guanosine, uridine or cytidine; "RNA 
molecules") or deoxyribonucleosides (deoxyadenosine, deoxy guano sine, 
15 deoxythymidine, or deoxycytidine; "DNA molecules") in either single-stranded 

form, or a double-stranded helix. Ribonucleic acid or RNA, particularly of the form 
of messenger RNA or mRNA, which is translated into polypeptide(s), and 
deoxyribonucleic acid or DNA can be readily isolated from cells, tissues, blood, 
etc. of individuals using methods readily known to the skilled artisan. Double- 
20 stranded DNA-DNA, DNA-RNA and RNA-RNA helices are possible. The term 
nucleic acid molecule, and in particular DNA or RNA molecule, refers only to the 
primary and secondary structure of the molecule, and does not limit it to any 
particular tertiary or quaternary forms. Thus, this term includes double-stranded 
DNA found, inter alia, in linear or circular DNA molecules (e.g., restriction 
25 fragments), plasmids, and chromosomes. In discussing the structure of particular 
double-stranded DNA molecules, sequences may be described herein according to 
the normal convention of giving only the sequence in the 5 ' to 3 ' direction along the 
nontranscribed strand of DNA (/.e., the strand having a sequence homologous to the 
mRNA). 
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Transcriptional and translational control sequences are DNA regulatory sequences, 
such as promoters, enhancers, polyadenylation signals, terminators, and the like, that 
provide for the expression of a coding sequence in a host cell. 

5 An "upstream regulatory region" is a DNA regulatory region capable of binding 
RNA polymerase in a cell and initiating transcription of a downstream (3' direction) 
coding sequence. For purposes of defining the present invention, the upstream 
regulatory region sequence is bounded at its 3' terminus by the transcription 
initiation site and extends upstream (5' direction) to include the minimum number of 
10 bases or elements necessary to initiate transcription at levels detectable above 
background and under appropriate regulatory control. Within the upstream 
regulatory region sequence will be found a transcription initiation site (conveniently 
defined by mapping with nuclease SI), as well as protein binding domains 
(consensus sequences) responsible for the binding of RNA polymerase and 
15 regulatory regions (consensus sequences) responsible for appropriate regulatory 
control, including cellular expression, induction of expression, etc. Eukaryotic 
promoters will often, but not always, contain "TATA" boxes and "CAT" or "CATA" 
boxes. 

20 An "expression control sequence" is a DNA sequence that controls and regulates the 
transcription and translation of another DNA sequence. A coding sequence is "under 
the control" of transcriptional and translational control sequences in a cell when 
RNA polymerase transcribes the coding sequence into mRNA, which is then 
translated into the protein encoded by the coding sequence. 

25 

The term "oligonucleotide," as used herein in referring to the probe or primer of the 
present invention, is defined as a molecule comprised of two or more 
ribonucleotides, preferably more than three. Its exact size will depend upon many 
factors which, in turn, depend upon the ultimate fianction and use of the 
30 oligonucleotide. The oligonucleotides of the invention are preferably from 10 to 50 
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nucleotides in length, even more preferably from 20-30 nucleotides in length or from 
1 5-25 nucleotides in length, and may be DNA, RNA or synthetic nucleic acid, and 
may be chemically or biochemically modified or may contain non-natural or 
derivatized nucleotide bases, as will be appreciated by those skilled in the art. Also 
5 included are synthetic molecules that mimic polynucleotides in their ability to bind 
to a designated sequence to form a stable hybrid. Such molecules are known in the 
art and include, for example, peptide nucleic acids (PNAs) in which peptide linkages 
substitute for phosphate linkages in the backbone of the molecule. 

10 The term "primer" as used herein refers to an oligonucleotide, whether occurring 
naturally as in a purified restriction digest or produced synthetically, which is 
capable of acting as a point of initiation of synthesis when placed under conditions in 
which synthesis of a primer extension product, which is complementary to a nucleic 
acid strand, is induced, i.e., in the presence of nucleotides and an inducing agent 
15 such as a DNA polymerase and at a suitable temperature and pH. The primer may be 
either single- stranded or double-stranded and must be sufficiently long to prime the 
synthesis of the desired extension product in the presence of the inducing agent. The 
exact length of the primer will depend upon many factors, including temperature, 
source of primer and use of the method. For example, for diagnostic applications, 
20 depending on the complexity of the target sequence, the oligonucleotide primer 

typically contains 10 or more nucleotides, preferably 15-25 nucleotides, although it 
may contain fewer nucleotides or more nucleotides. 

The primers herein are selected to be "substantially" complementaiy to different 
25 strands of a particular target DNA sequence. This means that the primers must be 

sufficiently complementary to hybridize with their respective strands. Therefore, the 
primer sequence need not reflect the exact sequence of the template. For example, a 
non-complementary nucleotide fragment may be attached to the 5' end of the primer, 
with the remainder of the primer sequence being complementary to the strand. 
30 Alternatively, non-complementary bases or longer sequences can be interspersed into 
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the primer, provided that the primer sequence has sufficient complementarity with 
the sequence of the strand to hybridize therewith and thereby form the template for 
the synthesis of the extension product. 

5 As used herein, the terms "restriction endonucleases" and "restriction enzymes" refer 
to bacterial enzymes, each of which cut double-stranded DNA at or near a specific 
nucleotide sequence. 

Two DNA sequences are "substantially homologous" when at least about 75% 
10 (preferably at least about 80%, and most preferably at least about 90 or 95%) of the 
nucleotides match over the defined length of the DNA sequences. Sequences that 
are substantially homologous can be identified by comparing the sequences using 
standard software available in sequence data banks, or in a Southern hybridization 
experiment under, for example, stringent conditions as defined for that particular 
15 system. Defining appropriate hybridization conditions is within the skill of the art. 
See, e.g., Maniatis et al., supra; DNA Cloning, Vols. I & II, supra; Nucleic Acid 
Hybridization, supra. 

Two amino acid sequences are "substantially homologous" when at least about 70% 
20 of the amino acid residues (preferably at least about 80%, and most preferably at 
least about 90 or 95%) are identical, or represent conservative substitutions. 

A "heterologous" region of the DNA construct is an identifiable segment of DNA 
within a larger DNA molecule that is not found in association with the larger 
25 molecule in nature. Thus, when the heterologous region encodes a mammalian gene, 
the gene will usually be flanked by DNA that does not flank the mammalian 
genomic DNA in the genome of the source organism. Another example of a 
heterologous coding sequence is a construct where the coding sequence itself is not 
found in nature (e.g., a cDNA where the genomic coding sequence contains introns, 
30 or synthetic sequences having codons different than the native gene). Allelic 
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variations or naturally-occurring mutational events do not give rise to a heterologous 
region of DNA as defined herein. 

A labeled oligonucleotide or primer may be utilized in the methods, assays and kits 
5 of the present invention. The labeled oligonucleotide may be utilized as a primer in 
PGR or other method of amplification and may be utilized in analysis, as a reactor or 
binding partner of the resulting amplified product. In certain methods, where 
sufficient concentration or sequestration of the CYP3A5 nucleic acid has occurred, 
and wherein the oligonucleotide label and methods utilized are appropriately and 
10 sufficiently sensitive, the nucleic acid may be directly analyzed, with the presence of, 
or presence of a particular label indicative of the resuh and diagnostic of the 
CYP3A5 genotype. After the labeled oligonucleotide or primer has had an 
opportunity to react with sites within the sample, the resulting product may be 
examined by known techniques, which may vary with the nature of the label 
15 attached. The label utilized may be radioactive or non-radioactive, including 

fluorescent, colorimetric or enzymatic. In addition, the label may be, for instance, a 
physical or antigenic tag which is characterized by its activity or binding. 

In the instance where a radioactive label, such as the isotopes ^H, "'C, ^^P, ^'S, ^^Cl, 
20 ''Cr "Co, '"Co, '^Fe, '°Y, '^'I, ''T, and '"^Re are used, known currently available 
counting procedures may be utilized. In the instance where the label is an enzyme, 
detection may be accomplished by any of the presently utilized colorimetric, 
spectrophotometric, fluorospectrophotometric, amperometric or gasometric 
techniques known in the art. 
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An "antibody" is any immunoglobulin, including antibodies and fragments thereof, 
that binds a specific epitope. The term encompasses polyclonal, monoclonal, and 
chimeric antibodies, the last mentioned described in further detail in U.S. Patent 
Nos. 4,816,397 and 4,816,567. 



30 
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An "antibody combining site" is that structural portion of an antibody molecule 
comprised of heavy and light chain variable and hypervariable regions that 
specifically binds antigen. 

5 The phrase "antibody molecule" in its various grammatical forms as used herein 
contemplates both an intact immunoglobulin molecule and an immunologically 
active portion of an immunoglobulin molecule. Exemplary antibody molecules are 
intact immunoglobulin molecules, substantially intact immunoglobulin molecules 
and those portions of an immunoglobulin molecule that contains the paratope, 
10 including those portions known in the art as Fab, Fab', F(ab'), and F(v), which 

portions are preferred for use in the therapeutic methods described herein. Fab and 
F(ab'), portions of antibody molecules can be prepared by the proteolytic reaction of 
papain and pepsin, respectively, on substantially intact antibody molecules by 
methods that are well-known. See for example, U.S. Patent No. 4,342,566 to 
15 Theofilopolous et al. Fab' antibody molecule portions are also well-known and are 
produced from F(ab'), portions followed by reduction of the disulfide bonds linking 
the two heavy chain portions as with mercaptoethanol, and followed by alkylation of 
the resulting protein mercaptan with a reagent such as iodoacetamide. An antibody 
containing intact antibody molecules is preferred herein. 

20 

The phrase "monoclonal antibody" in its various grammatical forms refers to an 
antibody having only one species of antibody combining site capable of 
immunoreacting with a particular antigen. A monoclonal antibody thus typically 
displays a single binding affinity for any antigen with which it immunoreacts. A 
25 monoclonal antibody may therefore contain an antibody molecule having a plurality 
of antibody combining sites, each immunospecific for a different antigen; e.g., a 
bispecific (chimeric) monoclonal antibody. 

A DNA sequence is "operatively linked" to an expression control sequence when the 
30 expression control sequence controls and regulates the transcription and translation 
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of that DNA sequence. The term "operatively Unked" includes having an appropriate 
start signal (e.g., ATG) in front of the DNA sequence to be expressed and 
maintaining the correct reading frame to permit expression of the DNA sequence 
under the control of the expression control sequence and production of the desired 
5 product encoded by the DNA sequence. If a gene that one desires to insert into a 
recombinant DNA molecule does not contain an appropriate start signal, such a start 
signal can be inserted in front of the gene. 
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The term "standard hybridization conditions" refers to salt and temperature 
10 conditions substantially equivalent to 5 x SSC and 65 °C for both hybridization and 
wash. However, one skilled in the art will appreciate that such "standard 
hybridization conditions" are dependent on particular conditions including the 
^"i concentration of sodium and magnesium in the buffer, nucleotide sequence length 

Ch and concentration, percent mismatch, percent formamide, and the like. Also 

3 15 important in the determination of "standard hybridization conditions" is whether the 

two sequences hybridizing are RNA-RNA, DNA-DNA or RNA-DNA. Such 
p standard hybridization conditions are easily determined by one skilled in the art 

S according to well known formulae, wherein hybridization is typically 1 0-20°C below 

[J the predicted or determined T„ with washes of higher stringency, if desired. 

20 

In its broadest aspect, the present invention extends to diagnostic assays, kits and 
methods for determining the cytochrome P450 3A5 (CYP3A5) genotype of a 
subject, thereby providing a means to determine the expression or activity of 
cytochrome P450 3A5 (CYP3A5) in a subject. This is particularly relevant in 
25 determining and assessing interpatient variation in drug metabolism. Specifically, in 
evaluating or determining appropriate drug therapy in patients undergoing cancer 
treatment or on immunosuppressant therapy, for instance, the determination of 
cytochrome P450 3A5 (CYP3 A5) genotype, and thereby predicting the expression 
of cytochrome P450 3A5 (CYP3A5) in organs or tissues, is relevant and useful. 
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As provided herein, it has now been recognized that sequence variations in the 
CYP3A5 gene, whereby the encoded CYP3A5 polypeptide is prematurely 
terminated, are responsible for hepatic polymorphism in cytochrome P450 3 A5 
(CYP3 A5) expression. It has now been recognized that sequence variations in the 
CYP3A5 gene, particularly in intron sequences, specifically intron 3, whereby the 
encoded CYP3A5 polypeptide is prematurely terminated, are responsible for hepatic 
polymorphism in cytochrome P450 3A5 (CYP3A5) expression. In addition, it has 
now been recognized that sequence variations in the CYP3A5 gene, particularly in 
exonic sequences, specifically exon 7, whereby the encoded CYP3A5 polypeptide is 
prematurely terminated, are additionally responsible for hepatic polymorphism in 
cytochrome P450 3A5 (CYP3A5) expression. Therefore, determination of the 
CYP3A5 sequence, and particularly of the intron 3 sequence genoty pe of an 
individual and/or of the exon 7 sequence genotype of an individual, can be utilized in 
predicting the expression or activity of cytochrome P450 3 A5 (CYP3 A5) in a 
subject, thereby determining and assessing inter-individual variation in drug 
metabolism. 

Assays and methods have been developed and are provided and described herein for 
determination of the CYP3A5 sequence, particularly of the intron 3 sequence, and 
most particularly of the nucleotide 22,893 (relative to Genbank AC005020) genotype 
of an individual. Assays and methods have been developed and are provided and 
described herein for determination of the CYP3A5 sequence, particularly of the exon 
7 sequence, and most particularly of the nucleotide 30,597 (relative to Genbank 
AC005020) genotype of an individual. 

In particular, assays and methods utilizing polymerase chain reaction (PGR) for 
amplification and assessment of the CYP3A5 gene or RNA are provided. Following 
amplification of the CYP3A5 sequence region, assessment may include, but is not 
limited to, direct visualization of the product (particularly wherein allele specific 
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oligonucleotide primers are utilized), sequencing, and restriction enzyme cleavage 
and analysis. 

The assays and methods of the present invention broadly and generally include and 
5 incorporate the following steps in determining the cytochrome P450 3 A5 (CYP3A5) 
genotype and phenotype of an individual: (a) isolation of nucleic acid from the 
individual; (b) amplification of at least the intron 3 cytochrome P4,50 3A5 {CYP3A5) 
sequence; and (c) analysis of the cytochrome P450 3A5 (CYP3A5) sequence. 

10 The assays and methods of the present invention broadly and generally include and 
C;3 incorporate the following steps in determining the cytochrome P450 3 A5 (CYP3 A5) 

^= genotype and phenotype of an individual: (a) isolation of nucleic acid fi-om the 

individual; (b) amplification of the cytochrome P450 3A5 {CYP3A5) sequence 
Cm including intron 3, intron 4 and intron 5; and (c) analysis of the cytochrome P450 

ig 15 3A5 {CYP3A5) sequence. 

Q The assays and methods of the present invention broadly and generally include and 

incorporate the following steps in determining the cytochrome P450 3A5 (CYP3A5) 
genotype and phenotype of an individual: (a) isolation of nucleic acid from the 
20 individual; (b) amplification of at least the exon 7 cytochrome P4:50 3 A5 {CYP3A5) 
sequence; and (c) analysis of the cytochrome P450 3A5 {CYP3A5) sequence. 

In practicing the assays and methods of the present invention, it is necessary to 
perform a step (a) to obtain, purify or otherwise isolate nucleic acid, DNA or mRNA 
25 for analysis. The term "isolation", "isolating" or "isolate" as used herein, and as 
applied to the methods and assays described herein, refers to and encompasses any 
method or approach known in the art whereby DNA or RNA can be obtained, 
procured, prepared, purified or isolated such that it is suitable for analysis, 
amplification, restriction enzyme cleavage and/or sequencing as provided in the 
30 methods and assays of the present invention. Various methods for the isolation or 
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procurement of nucleic acid may be employed, as any skilled artisan may know and 
practice. Such methods may include methods employed for the isolation of genomic 
DNA or mRNA in various forms and states of purity and may not necessarily 
involve or require the separation of DNA or RNA from all cellular debris, protein, 
etc. The term isolation as used herein is contemplated to include the preparation of 
cell or tissue samples whereby DNA or RNA may be analyzed, amplified, etc. in 
situ. In the event mRNA is utilized, a first copy of DNA may be generated 
therefrom, for instance by reverse transcription using e.g. reverse transcriptase (RT), 
followed by amplification of the DNA copy or cDNA. 
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The step (b) may be performed utilizing any method of amplification, including 
polymerase chain reaction (PGR), Ugase chain reaction (Barany, F. (1991) Proc. 
Natl. Acad. Sci. 88:189-193), rolling circle amplification (Lizardi, P.M. et al (1998) 
Nature Genetics 19:225-232), strand displacement amplification (Walker, G.T. et al 
15 (1992) Proc. Nati. Acad. Sci. 89:392-396) or alternatively any means or method 
whereby concentration or sequestration of sufficient amounts of the cytochrome 
P450 3A5 {CYP3A5) nucleic acid for analysis may be obtained. The primers for use 
in amplification of at least the intron region and/or the exon 7 region of CYP3A5 
may be selected and utilized by the skilled artisan employing tiie sequence of 
20 cytochrome P450 3 A5 {CYP3A5) as available at the National Center for 

Biotechnology Information (NCBI) (www.ncbi.nlm.nih.gov) as Genbank entry 
AC005020, the complete sequence of Homo sapiens BAG clone Gsl-259H13 
(Sulston, J.E. and Waterston, R. (1998) Genome Res. 8(11), 1097-1108). This 
particular sequence was utilized in the design and sequence of primers exemplified 
25 herein. In addition, Genbank entry L26985 which sequence was published by 

Schuetz et al (Schuetz, J. et al (1995) Biochem Biophys Acta 1261:161-165. This 
sequence was originally described as a GYP3A5 pseudogene, but is actually a 
spliced variant mRNA, similar to the GYP3A5*3 allele product described herein. 
Particular exemplary primers are provided herein and include oHgonucleotide 
30 primers having the sequence set out in SEQ ID NOS: 16, 24-27 and 30-32. Based c 
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the sequence of the mutant alleles provided herein, PGR primers are constructed that 
are complementary to the region of the mutant allele encompassing the point 
mutation. A primer consists of a consecutive sequence of polynucleotides 
complementary to any region in the allele encompassing the position which is 
5 mutated in the mutant allele. The size of these amplification/PCR primers range 
anywhere from five bases to hundreds of bases. However, the preferred size of a 
primer is in the range from 10 to 50 bases, most preferably from 15 to 35 bases. As 
the size of the primer decreases so does the specificity of the primer for the targeted 
region. Hence, even though a primer which is less than five bases long will bind to 
10 the targeted region, it also has an increased chance of binding to other regions of the 
13 template polynucleotide which are not in the targeted region and do not contain the 

"2 polymorphic/mutated base. Conversely, a larger primer provides for greater 

^'4 Specificity, however, it becomes quite cumbersome to make and manipulate a very 

fn large fragment. Nevertheless, when necessary, large fragments are employed in the 

Iq 15 method of the present invention. To amplify the region of the genomic DNA of the 

^ . individual patient, primers to one or both sides of the targeted position, for instance 

a the third intron (intron 3) and particularly the A/G point mutation at nucleotide 

□ 22,893 (relative to Genbank AG005020) , and also the position in exon 7 and 

particularly the G/A point mutation at nucleotide 30,597 (relative to Genbank 
20 AC005020), are made and used in a PGR amplification reaction, using known 

methods in the art (e.g. Massachusetts General Hospital & Harvard Medical School, 
Current Protocols In Molecular Biology, Chapter 15 (Green Publishing Associates 
and Wiley-Interscience 1991) and as particularly exemplified herein. 



25 The (c) analysis step will utilize skills and methods available to tire skilled artisan for 
determining and distinguishing a sequence and can include: direct sequencing of the 
amplified or otherwise sequestered product; hybridization utilizing a labeled probe or 
labeled probe set including one specific for any single nucleotide polymorphism, 
including A/G at nucleotide 22,893 and G/A at nucleotide 30,597 as provided herein; 

30 direct visualizafion of the PGR product by gel separation or by the presence of a non- 
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radioactive dye or fluorescent dye introduced with the primer, particularly wherein 
allele specific oligonucleotide primers are utilized (including fluorescence as 
provided by the molecular beacon technology (Tyagi, S. and Kramer, F. (1 996) 
Nature Biotech 14:303-308; Tyagi, S. et al (1998) Nature Biotech 16:49-53); 
5 restriction enzyme analysis wherein restriction enzyme cleavage is characteristic of 
the upstream regulatory region sequence; sequencing by hybridization, etc. 

In particular embodiments described herein, at least the intron 3 region of 
cytochrome P450 3A5 (CYP3A5) is amplified utilizing primers which amplify from 
10 5' or upstream of the intron 3 region, particularly 5' and 3' of the intronic region, and 
more particularly 5' and 3' of the nucleotide 22,893 point mutation. Various primers 
can be utilized, including the particular primer pairs exemplified herein (particularly 
primer pairs SEQ ID NO: 24 and 25; or primer pairs SEQ ID NO: 26 and 27). 

15 In particular embodiments described herein, at least the exon 7 region of cytochrome 
P450 3A5 (CYP3A5) is amplified utilizing primers which amplify from 5' or 
upstream of the exon 7 region, particularly 5' and 3' of the exon 7 region, and more 
particularly 5' and 3' of the nucleotide 30,597 point mutation. Various primers can 
be utilized, including the particular primer pairs exemplified herein (particularly 

20 primer pairs SEQ ID NO: 30 and 16; or primer pairs SEQ ID NO: 31 and 32). 

Following amplification, the PGR product may be sequenced, subjected to a second 
round of amplification, or otherwise analyzed in step (c). The sequence may be 
determined using any of various methods known in the art, including but not limited 
25 to traditional sequencing methodologies and more rapid and high throughput mini- 
sequencing or pyrosequencing, including but not limited to those exemplified in Cai 
et al, Sun et al and Alimadian et al, which references are incorporated herein in their 
entirety by reference (Cai, H al (2000) Genomics 66(2):135-143; Sun, X et al (2000) 
Nucleic Acids Res 28(12):E68; Ahmadian, A. et al (2000) Anal Biochem 
30 280(1): 103-1 10). In uUlizing certain of these particularly sensitive and efficient 
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sequencing methodologies it may, in fact, not be necessary to perform the (b) 
ampUfication step, provided that suitable starting nucleic acid is isolated in (a) for 
analysis. By utilizing methods which do not require sequencing and whereby the 
single nucleotide polymorphism, particularly the nucleotide 22,893 (relative to 
5 Genbank AC005020) and nucleotide 30,597 (relative to Genbank AC005020) allelic 
sequences can be directly determined or inferred, one can rapidly screen genomic 
DNA from many individuals. A two step or two round amplification approach has 
been used successfully to examine polymorphisms in the CYP2D6 and in thiopurine 
methyltransferase (TPMT) genes (Evans et al. Pharmacogenetics, 1: 143-148, 1991.). 
10 As exemplified herein, a nested PGR approach can be utilized. The first round of 
amplification is used to amplify the gene segment that may contain the mutation of 
interest. The second round of amplification can utilize one of the common primers 
as its first primer but importantly makes use of a second primer for either the 
wild-type sequence or for the mutation of interest, and using PGR conditions that 
15 produce sequence specific amplification. Alternatively, restriction enzyme digestion 
of the first or second round PGR product may be used to detect the presence or 
absence of a mutation, when a restriction site is either gained or lost. The second 
round of amplification may utilize an allele specific system, for instance, a mismatch 
directed primer, wherein a base change is specifically introduced by the primer, 
20 thereby generating a restriction site at or near the site of the point mutation in a 

particular allele. Further an allele specific system, for instance, a mismatch directed 
primer, may be used as one primer in a single round of amplification, wherein a base 
change is specifically introduced by the primer, thereby generating a restriction site 
at or near the site of the point mutation in a particular allele (this approach is 
25 exemplified herein in Example 3). Alternatively, an allele specific oligonucleotide, 
ligase chain reaction, etc. may be utilized so as to generate product only in the 
presence of a particular base or provide products which are distinguishable by dye, 
label, size, etc. in each case. 
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Other rapid pharmacogenetic single nucleotide polymorphism (SNP) screening 
technologies which can be employed and are contemplated as suitable for step (c) 
currently exist and could be utilized by the skilled artisan to identify or characterize 
the CYP3A5*1, CYP3A5*3 and CYP2A5*6 alleles and particularly the nucleotide 
5 22,893 (relative to Genbank AC005020) and nucleotide 30,597 (relative to Genbank 
AC005020) SNPs. Various SNP detection methodologies are presently available or 
offered by commercial companies, including Aclara Biosciences, Orchid 
Biosciences, Qiagen Genomics, PPGX, and Affymetrix. Exemplary such SNP 
detection methodologies, particularly those of Orchid Biosciences, =^e provided in 
10 United States Patent Numbers 6,013,431, 6,004,744, 5,952,174 and 5,939,291, 
which are incorporated herein by reference in their entirety. Qiagen Genomics 
utilizes a Masscode system for SNP genotyping, whereby a mass spectrometer is 
utilized to image/detect Masscode tags attached to DNA molecules via a 
photochemical linker. A mass-tagged system for SNP detection is also provided by 
15 Fei and Smith (Fei, Z. and Smith, L.M. (2000) Rapid Comm Mass Spectrom 

14(1 1):950-959. Multiplex chip or flow cytometry systems for parallel genotyping 
may also be utilized, as described by Affymetrix and Axys Pharmaceuticals (Fan, 
J.B. et al (2000)Genome Res l0(6):853-860; Armstrong, B. et al (2000) Cytometry 
40(2):102-108). 
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Assays are also contemplated wherein the single nucleotide polymorphism in the 
intron 3 and/or exon 7 region(s) of cytochrome P450 3A5 iCYPSAS) is determined 
by isolating nucleic acid from a subject and analysis of the single nucleotide 
polymorphism wherein the polymorphism generates a restriction site recognized by 
25 unique restriction enzyme. Thereby, the genotype may be directly analyzed by 
restriction polymorphism analysis via gel electrophoresis or DNA probe 
hybridization and chracterization of DNA fragments. 

In particular, the present invention provides a diagnostic assay for determining 
30 cytochrome P450 3A5 {CYP3A5) genotype of a subject which comprises 
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(a) isolating nucleic acid from said subject; and 

(b) characterizing intron 3 of the cytochrome P450 3A5 (CYP3A5) sequence, thereby 
determining the cytochrome P450 3A5 (CYP3A5) intron 3 genotype of said subject. 

5 In particular, the present invention provides a diagnostic assay for determining 
cytochrome P450 3A5 {CYP3A5) genotype of a subject which comprises 

(a) isolating nucleic acid from said subject; and 

(b) characterizing exon 7 of the cytochrome P450 3A5 (CYP3A5) sequence, thereby 
determining the cytochrome P450 3A5 (CYP3A5) exon 7 genotype of said subject. 

10 

The present invention further provides a diagnostic assay for determining 
cytochrome P450 3A5 (CYP3A5) genotype of a subject which comprises 

(a) isolating nucleic acid from said subject; 

(b) amplifying a cytochrome P450 3A5 (CYP3A5) PGR fragment from said nucleic 
1 5 acid, which includes at least intron 3 , thereby obtaining an amplified fragment; and 

(c) sequencing the amplified fragment obtained in step (b), thereby determining the 
cytochrome P450 3A5 (CYP3A5) intron 3 genotype of said subject. 



In a further embodiment, the invention provides a diagnostic assay for determining 
20 cytochrome P450 3 A5 (CYP3 A5) genotype of a subj ect which comprises 

(a) isolating nucleic acid from said subject; 

(b) amplifying a cytochrome P450 3A5 (CYP3A5) PGR fragment from said nucleic 
acid, which includes at least intron 3, thereby obtaining an amplified fragment; and 

(c) sequencing the amplified fragment obtained in step (b), thereby determining the 
25 cytochrome P450 3 A5 (GYP3 A5) nucleotide 22,893 genotype of said subject. 

In a further aspect, the present invention provides a diagnostic assay for determining 
cytochrome P450 3A5 (GYP3 A5) intron 3 genotype of a subject which comprises 
(a) isolating nucleic acid from said subject; 
30 (b) amplifying a cytochrome P450 3A5 (GYP3A5) PGR fragment from said nucleic 
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acid using a set of primers, wherein said set of primers contains primer X and primer 
Y; wherein 

(i) the X primer is complementary to a region 5' to the point mutation site at 
nucleotide 22,893 of Genbank sequence accession no. AC005020; 
5 (iii) the Y primer is complementary to a region 3' to the point mutation site at 
nucleotide 22,893 of Genbank sequence accession no. AC005020; 

(c) amplifying the sequence in between primers X and Y, thereby obtaining an 
amplified fragment; and 

(d) sequencing the amplified fragment obtained in step (c), thereby determining the 
10 cytochrome P450 3A5 (CYP3A5) intron 3 genotype of said subject. 

In a particular example, the present invention extends to a diagnostic assay, wherein 
primer X has the sequence corresponding to SEQ ID NO: 24, or a fragment thereof 
which is at least ten bases long, and primer Y has the sequence con esponding to 
15 SEQ ID NO: 25, or a fragment thereof which is at least ten bases long. 

In a further example, the present invention extends to a diagnostic assay, wherein 
primer X has the sequence corresponding to SEQ ID NO: 26 , or a fragment thereof 
which is at least ten bases long, and primer Y has the sequence coixesponding to 
20 SEQ ID NO: 27, or a fragment thereof which is at least ten bases long. 

The present invention also provides and contemplates a diagnostic assay for 
determining cytochrome P450 3A5 (CYP3A5) intron 3 genotype of a subject which 
comprises 

25 (a) isolating nucleic acid from said subject; 

(b) amplifying a cytochrome P450 3A5 (CYP3 A5) PGR fi-agment from said nucleic 
acid which includes at least intron 3, thereby obtaining an amplified fragment; and 

(c) treating the amplified DNA fragment obtained in step (b) with restriction enzyme 
in its corresponding restriction buffer to detect presence or absence of a point 

30 mutation at nucleotide 22,893 of Genbank sequence accession no. AC005020, 
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thereby determining the cytochrome P450 3A5 (CYP3A5) genotype of said subject. 



In a particular embodiment, the present invention includes a diagnostic assay for 
determining cytochrome P450 3A5 (CYP3A5) genotype of a subject which 
5 comprises 

(a) isolating nucleic acid from said subject; 

(b) making a first and a second PGR primer wherein 

(i) the first PGR primer is complementary to intron 3 and introduces a base 
change in the PGR product adjacent to or near the point mutation at nucleotide 

10 22,893 of Genbank sequence accession no. AC005020, such that a restriction site is 
generated in the presence of a particular nucleotide at nucleotide 22,893; and 

(ii) the second PGR primer is complementary to a region 3' to the intron 3 
nucleotide 22,893 of Genbank sequence accession no. AG005020; 

(c) amplifying the sequence in between the first and the second primers; thereby 
15 obtaining an amplified fragment; and 

(d) treating the amplified fragment obtained in step (c) with a restriction enzyme in 
its corresponding restriction buffer to detect presence or absence of a point mutation 
at nucleotide 22,893, thereby determining the cytochrome P450 3A5 (GYP3A5) 
genotype of said subject. 

20 

In a particular embodiment of the above assay, the first primer introduces a Tru9 
\IMse\ restriction site in the presence of an A nucleotide at nucleotide 22,893, and 
second primer has the sequence selected from SEQ ID NO:27 and SEQ ID NO: 25 , 
or a fragment thereof which is at least ten bases long. In a fiirther particular 

25 embodiment of the above assay, the first primer has the sequence corresponding to 
SEQ ID NO: 33, or a fragment thereof which is at least ten bases long, and second 
primer has the sequence corresponding to SEQ ID NO: 27, or a fi-agment thereof 
which is at least ten bases long. In a still ftirther particular embodiment of the above 
assay, the first primer has the sequence corresponding to SEQ ID NO:33, or a 

30 fragment thereof which is at least ten bases long, and second primer has the sequence 
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corresponding to SEQ ID NO:25 , or a fragment thereof which is at least ten bases 
long. 



5 In a further aspect, the present invention provides a diagnostic assay for determining 
cytochrome P450 3A5 (CYP3A5) intron 3 genotype of a subject which comprises 

(a) isolating nucleic acid from said subject; 

(b) amplifying a cytochrome P450 3A5 (CYP3A5) PGR fragment from said nucleic 
acid using a first set of primers, wherein said first set of primers contains primer X 

10 and primer Y; wherein 

(i) the X primer is complementary to a region 5' to the point mutation site at 
nucleotide 22,893 of Genbank sequence accession no. AC005020; 

(ii) the Y primer is complementary to a region 3' to the point mutation site at 
nucleotide 22,893 of Genbank sequence accession no. AC005020; 

15 (c) amplifying the sequence in between primers X and Y, thereby obtaining an first 
round amplified fragment; 

(d) amplifying the first round amplified fragment using a second set of primers, 
wherein said second set of primers contains primer Z and primer W, wherein 

(i) primer Z is complementary to intron 3 and infroduces a base change in the 
20 PGR product adjacent to or near the point mutation at nucleotide 22,893 of Genbank 

sequence accession no. AC005020, such that a restriction site is generated in the 
presence of a particular mutation at nucleotide 22,893; and 

(ii) primer W is complementary to a region 3' to intron 3; 

(e) amplifying the sequence in between primers Z and W, thereby obtaining an 
25 amplified fragment; and 

(f) treating the amplified fragment obtained in step (e) with a restriction enzyme in 
its corresponding restriction buffer to detect presence or absence of a point mutation 
at nucleotide 22,893, thereby determining the cytochrome P450 3A5 (CYP3A5) 
genotype of said subject. 

30 
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In a particular embodiment of the above assay, primer X has the seqvience 
corresponding to SEQ ID NO: 24, or a fragment thereof which is at least ten bases 
long; primer Y has the sequence selected from the group of SEQ ID NO:25, or a 
fragment thereof which is at least ten bases long; primer Z introduces a Tru9 VMsel 
restriction site in the presence of an A nucleotide at nucleotide 22,893; and primer 
W has the sequence selected from SEQ ID NO: 27 and SEQ ID NO: 25, or a 
fragment thereof which is at least ten bases long. In a further particular embodiment 
primer Z has the sequence corresponding to SEQ ID NO: 33 , or a fiagment thereof 
which is at least ten bases long. 
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In a particular aspect, the invention provides a method for detecting the presence or 

activity of cytochrome P450 3A5 (CYP3A5), wherein said cytochrome P450 3A5 

(CYP3A5) is measured by: 

(a) isolating nucleic acid from said subject; 
15 (b) ampUfying a cytochrome P450 3A5 (CYP3A5) PGR fragment from said nucleic 
acid, which includes at least intron 3, thereby obtaining an amplified fragment; and 
(c) sequencing the ampUfied fragment obtained in step (b), thereby determining the 
cytochrome P450 3A5 (CYP3A5) intron 3 genotype of said subject; 
wherein the detection of a G nucleotide at nucleotide 22,893 of Genbank sequence 
20 accession no. AC005020 indicates the presence or activity of said cytochrome P450 
3A5 (CYP3A5) in said sample. 

The present invention further provides a diagnostic assay for determining 
cytochrome P450 3 A5 (CYP3 A5) genotype of a subject which comprises 
25 (a) isolating nucleic acid from said subject; 

(b) amplifying a cytochrome P450 3A5 (CYP3A5) PGR fragment from said nucleic 
acid, which includes at least exon 7, thereby obtaining an amplified fragment; and 

(c) sequencing the amplified fragment obtained in step (b), thereby determining the 
cytochrome P450 3A5 (GYP3A5) exon 7 genotype of said subject. 

30 
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In a further embodiment, the invention provides a diagnostic assay for determining 
cytochrome P450 3A5 (CYP3A5) exon 7 genotype of a subject which comprises 

(a) isolating nucleic acid from said subject; 

(b) amplifying a cytochrome P450 3A5 (CYP3A5) PGR fragment from said nucleic 
5 acid, which includes at least exon 7, thereby obtaining an amplified fragment; and 

(c) sequencing the amplified fragment obtained in step (b), thereby determining the 
cytochrome P450 3A5 (CYP3A5) nucleotide 30,597 genotype of said subject. 

In a further aspect, the present invention provides a diagnostic assay for determining 
10 cytochrome P450 3 A5 (CYP3 A5) exon 7 genotype of a subject which comprises 

(a) isolating nucleic acid from said subject; 

(b) amplifying a cytochrome P450 3 A5 (CYP3A5) PGR fragment from said nucleic 
acid using a set of primers, wherein said set of primers contains primer X and primer 
Y; wherein 

15 (i) the X primer is complementary to a region 5' to the point mutation site at 

nucleotide 30,597 of Genbank sequence accession no. AC005020; 

(iii) the Y primer is complementary to a region 3' to the point mutation site at 
nucleotide 30,597 of Genbank sequence accession no. AC005020; 

(c) amplifying the sequence in between primers X and Y, thereby obtaining an 
20 amplified fragment; and 

(d) sequencing the amplified fragment obtained in step (c), thereby determining the 
cytochrome P450 3A5 (CYP3A5) exon 7 genotype of said subject. 

In a particular example, the present invention extends to a diagnostic assay, wherein 
25 primer X has the sequence corresponding to SEQ ID NO: 30 , or a fragment thereof 
which is at least ten bases long, and primer Y has the sequence corresponding to 
SEQ ID NO: 16, or a fragment thereof which is at least ten bases long. 

In a frirther example, the present invention extends to a diagnostic assay, wherein 
30 primer X has the sequence corresponding to SEQ ID NO: 3 1 , or a fragment thereof 
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which is at least ten bases long, and primer Y has the sequence corresponding to 
SEQ ID NO: 32, or a fragment thereof which is at least ten bases long. 

The present invention also provides and contemplates a diagnostic assay for 
determining cytochrome P450 3A5 (CYP3A5) genotype of a subject which 
comprises 

(a) isolating nucleic acid from said subject; 

(b) amplifying a cytochrome P450 3A5 (CYP3 A5) PGR fragment from said nucleic 
acid, which includes at least exon 7, thereby obtaining an amplified fragment; and 

(c) treating the ampUfied DNA fragment obtained in step (b) with restriction enzyme 
in its corresponding restriction buffer to detect presence or absence of a point 
mutation at nucleotide 30,597 of Genbank sequence accession no. AG005020 
thereby determining the cytochrome P450 3A5 (GYP3A5) genotype of said subject. 

15 In a particular embodiment, the present invention includes a diagnostic assay for 
determining cytochrome P450 3A5 (GYP3A5) genotype of a subject which 
comprises 

(a) isolating nucleic acid from said subject; 

(b) making a first and a second PGR primer wherein 

20 (i) the first PGR primer is complementary to exon 7 and introduces a base 

change in the PGR product adjacent to or near the point mutation at nucleotide 
30,597 of Genbank sequence accession no. AG005020, such that a restriction site is 
generated in the presence of a particular nucleotide at nucleotide 30,597; and 

(ii) the second PGR primer is complementary to a region 3' to the intron 3 
25 nucleotide 30,597 of Genbank sequence accession no. AG005020; 

(c) amplifying the sequence in between the first and the second primers; thereby 
obtaining an amplified fragment; and 

(d) treating the amplified fragment obtained in step (c) with a restriction enzyme in 
its corresponding restriction buffer to detect presence or absence of a point mutation 
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at nucleotide 30,597, thereby determining the cytochrome P450 3 A5 (CYP3 A5) 
genotype of said subject. 

In a particular embodiment of the above assay, the first primer introduces a Tru9 
5 lIMsel restriction site in the presence of a A nucleotide at nucleotide 30,597, and 
second primer has the sequence selected from SEQ ID NO: 32 and SEQ ID NO: 16 , 
or a fragment thereof which is at least ten bases long. In a further particular 
embodiment of the above assay, the first primer has the sequence corresponding to 
SEQ ID NO: 34, or a fragment thereof which is at least ten bases long, and second 
10 primer has the sequence corresponding to SEQ ID NO: 32, or a fragment thereof 

which is at least ten bases long. In a still further particular embodiment of the above 
assay, the first primer has the sequence corresponding to SEQ ID NO: 34, or a 
fragment thereof which is at least ten bases long, and second primer has the sequence 
corresponding to SEQ ID NO: 16, or a fragment thereof which is at least ten bases 
15 long. 



In a fiarther aspect, the present invention provides a diagnostic assay for determining 
cytochrome P450 3A5 (CYP3A5) exon 7 genotype of a subject which comprises 
20 (a) isolating nucleic acid from said subject; 

(b) amplifying a cytochrome P450 3A5 (CYP3A5) PGR fragment from said nucleic 
acid using a first set of primers, wherein said first set of primers contains primer X 
and primer Y; wherein 

(i) the X primer is complementary to a region 5' to the point mutation site at 
25 nucleotide 30,597 of Genbank sequence accession no. AC,005020; 

(ii) the Y primer is complementary to a region 3' to the point mutation site at 
nucleotide 30,597 of Genbank sequence accession no. AC005020; 

(c) amplifying the sequence in between primers X and Y, thereby obtaining an first 
round amplified fragment; 
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(d) amplifying the first round amplified fragment using a second set of primers, 
wherein said second set of primers contains primer Z and primer W, wherein 

(i) primer Z is complementary to exon 7 and introduces a base change m the 
PGR product adjacent to or near the point mutation at nucleotide 30,597 of Genbank 

5 sequence accession no. AC005020, such that a restriction site is generated m the 
presence of a particular mutation at nucleotide 30,597; and 

(ii) primer W is complementary to a region 3' to exon 7; 

(e) amplifying the sequence in between primers Z and W, thereby obtaining an 

amplified fragment; and 
10 (f) tteattag U,e ampl,f,.d fragment obtained in s«.p (e) with a restriction enzyme ,n 
its correspondmg restriction buffer to detect presence or absence of a point mutatton 
at nucleotide 30,597. thereby determining the cytochrome P450 3A= (CYP3A5) 

genotype of said subject. 

,5 In a particular embodiment of the above assay, primer X has the sequence 

corresponding to SEQ ID NO: 30, or a fragment thereof «hich is at least ten bases 
long- primer Y has the sequence of SEQ ID NO: 16 , or a fragment thereof «h,ch .s 
at lelst ten bases long: primer Z introduces a Tru9 VM.A restriction site m the 
pn^sence of an A nucleotide at nucleotide 30,597: and primer W has the sequence 
selected from SEQ ID NO: 32 and SEQ ID N0:16 , or a fragment thereof which ,s at 
least ten bases long^ In a further particular emboditnent primer Z has the sequence 
corresponding to SEQ ID NO: 34, or a fragment thereof which is at least ten bases 

long. 

25 In a particular aspect, the invention provides a method for detecting the presence or 
activity of cytochrome P450 3A5 (CYP3 A5), wherein said cytochrome P450 3A5 

(CYP3A5) is measured by. 

(a) isolating nucleic acid from said subject; 

(b) amplifying a cytochrome P450 3 A5 (CYP3A5) PGR fiagmerU from saxd nucle.c 
. 30 acid, which includes at least exon 7, thereby obtainmg an amplified fragment; and 
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(c) sequencing the amplified fragment obtained in step (b), thereby determining the 
cytochrome P450 3A5 (CYP3A5) exon 7 genotype of said subject; 
wherein the detection of a A nucleotide at nucleotide 30,597 of Genbank sequence 
accession no. AC005020 indicates the presence or activity of said cytochrome P450 
5 3A5 (CYP3A5) in said sample. 

In a further embodiment of this invention, commercial test kits suitable for use by a 
medical specialist may be prepared to determine cytochrome P450 3 A5 (CYP3 A5) 
genotype, and thereby determine the presence or absence of cytochrome P450 3A5 
10 (CYP3A5) in an individual patient. 

Accordingly, a test kit may be prepared for determining the cytochrome P450 3A5 
(CYP3A5) genotype of an individual, whereby the sequence of the upstream 
regulatory region of cytochrome P450 3A5 (CYP3A5) is determined, and in a 
15 particular embodiment the sequence of intron 3 of cytochrome P450 3A5 (CYP3A5) 
is determined. Further, a test kit may be prepared for determining the cytochrome 
P450 3A5 (CYP3A5) genotype of an individual, whereby the sequence of the 
upstream regulatory region of cytochrome P450 3A5 (CYP3A5) is determined, and 
in a particular embodiment the sequence of exon 7 of cytochrome P450 3 A5 
20 (CYP3 A5) is determined. The test kit may include the PGR amplification of intron 3 
and/or exon 7 of cytochrome P450 3A5 (CYP3A5). In one embodiment, intron 3 of 
cytochrome P450 3A5 (GYP3A5) is amplified and then directly sequenced. In one 
embodiment, exon 7 of cytochrome P450 3A5 (CYP3A5) is amplified and then 
directly sequenced. In an additional embodiment, intron 3 and/or exon 7 of 
25 cytochrome P450 3A5 (GYP3A5) is amplified and its characteristic sequence is 
determined by assessing susceptibility of the PGR product to cleavage with a 
particular restriction enzyme or a set of restriction enzymes. In a further 
embodiment, specific primer sets are utilized in amplification of intron 3 and/or exon 
7 of cytochrome P450 3A5 (GYP3A5) and the presence or absence of PGR product 
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with *e specific prirner se. is evaluated in dc—g *e cytoctacne P450 3 A5 

(CYP3A5) genotype. 

a further aspect, the present invention prov.des a test kit for detemuning 
5 cytochrome P450 3A5 (CyP3A5) genotype in a eukaryotic cellular sample. 

comprising: . ^ r, ^.^ 

(a) apredetermined amountofaflrstPCR primer complementary toareg,on5 to 

(*Cldetermined amount of a second PCR primer complementary to a region 3' 
10 to intron 3; 

(c) other reagents; and ^ 

(d) directions for use of said kit. 

u ^- t the first PCR primer in a test kit of the present invention 
In a particular embodiment, the iirst i^i.k priiu 

^- to ciFO ID NO- 24 or SEQ ID NO:26, or a fragment 
15 has a sequence correspondmg to SEQ lU NU. 

thereof which is at least ten bases long. 

m a further particular embodiment, the second PCR primer in a test kU of the present 
iJentionhLsequence corresponding to SEQIDKO:^ 
20 fragment thereof which is at least ten bases long. 

inafurther particular aspect, the present invention providesatestkitfor—ing 
cytochrome P450 3 A5 (CYP3A5) genotype in a eukaryotic cellular sample, 

comprising: , ,f , fost PCR primer complementaiy to the nucleotide 

9 S (a) a predetermined amount of a tirst 1 i^K prime 

u V, in the PCR product adjacent to or near 
22 893 region that introduces a base change m the iCRproQ 

, .-A. 00 such that a restriction site is generated m the 
the point mutation at nucleotide 22,89.5, sucn tnav 

presence of a particular mutation at nucleotide 22,893; 

^ ^.,ri PPR nrimer complementary to a region 3 
(b) a predetermined amount of a second PCR primer comp 

30 to intron 3; 



(c) other reagents; and 

(d) directions for use of said kit. 
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In a particular embodiment, the first PGR primer in a test kit of the present invention 
5 introduces a Tru9 l/Mspl restriction site in the presence of an A nucleotide at 
nucleotide 22,893 of Genbank sequence accession no. AC005020. 

In a further particular embodiment, the first PGR primer in a test kit of the present 
invention has a sequence corresponding to SEQ ID NO: 33, or a fragment thereof 
10 which is at least ten bases long. In a still further particular embodiment, the second 
PGR primer in a test kit of the present invention has a sequence corresponding to 
SEQ ID NO: 27, or a fragment thereof which is at least ten bases long. 

In a further aspect, the present invention provides a test kit for determining 
15 cytochrome P450 3A5 (CYP3A5) genotype in a eukaryotic cellular sample, 
comprising: 

(a) a predetermined amount of a first PGR primer complementary to a region 5' to 
exon 7; 

(b) a predetermined amount of a second PGR primer complementary to a region 3' 
20 to exon 7; 

(c) other reagents; mid 

(d) directions for use of said kit. 

In a particular embodiment, the first PGR primer in a test kit of the present invention 
25 has a sequence corresponding to SEQ ID NO: 30 or SEQ ID NO: 3 1 , or a fragment 
thereof which is at least ten bases long. 

In a further particular embodiment, the second PGR primer in a test kit of the present 
invention has a sequence corresponding to SEQ ID NO: 16 or SEQ ID NO:32 , or a 
30 fragment thereof which is at least ten bases long. 
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In a further particular aspect, the present invention provides a test kit for determining 
cytochrome P450 3A5 (CYP3A5) genotype in a eukaryotic cellular sample, 
comprising: 

5 (a) a predetermined amount of a first PGR primer complementary to nucleotide 
30,597 region of Genbank sequence accession no. AC005020 that introduces a base 
chlnge in the PGR product adjacent to or near the point mutation at nucleotide 
30,597, such that a restriction site is generated in the presence of a particular 
mutation at nucleotide 30,597; 

10 (b) a predetermined amount of a second PGR primer complementary to a region 3' 

to exon 7; 

(c) other reagents; and 

(d) directions for use of said kit. 

15 In a particular embodiment, the first PGR primer in a test kit of the present invention 
introduces a Tru9 VMspl restriction site in the presence of an A nucleotide at 
nucleotide 30,597 of Genbank sequence accession no. AC005020. 

In a further particular embodiment, the first PGR primer in a test kit of the present 
20 invention has a sequence corresponding to SEQ ID NO: 34 , or a fragment thereof 
which is at least ten bases long. In a still further particular embodiment, the second 
PGR primer in a test kit of the present invention has a sequence correspondmg to 
SEQ ID NO: 32, or a fragment thereof which is at least ten bases long. 

25 In a fiarther aspect, the present invention provides oligonucleotide primers or probes 
suitable for use in the determination of cytochrome P450 3A5 (GYP3 A5) genotype, 
particularly for determination of cytochrome P450 3A5 (GYP3A5) intron 3 
genotype. In a further aspect, the present invention provides oUgonucleotide prxmers 
or probes suitable for use in the determination of cytochrome P450 3 A5 (GYP3 A5) 
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genotype, particularly for determination of cytochrome P450 3A5 (CYP3A5) exon 7 
genotype. 

In a particular embodiment, the present invention relates to an isolated 
5 oligonucleotide primer having a sequence selected from the group of SEQ ID NOS: 
24-27 and 33 or a fragment thereof which is at least ten bases long. 

In a further embodiment, the present invention relates to an isolated oligonucleotide 
primer having a sequence selected from the group of SEQ ID NOS: 16, 30-32 and 
10 34 or a fragment thereof w^hich is at least ten bases long. 

The possibilities both diagnostic and therapeutic that are raised by the interindividual 
variation in expression of cytochrome P450 3A5 (CYP3A5), derive from the fact 
that cytochrome P450 3A5 (CYP3A5) contributes to and is responsible for 

15 metabolism of drugs, compounds, chemicals or other agents, particularly those 

utilized in treatment and prevention of particular diseases and conditions, including 
cancer treatment and immunosuppressant protocols. As suggested earlier and 
elaborated further on herein, the present invention contemplates determination of the 
cytochrome P450 3 A5 (CYP3A5) genotype, and thereby cytochrome P450 3A5 

20 (CYP3 A5) protein expression of an individual. The knowledge of the cytochrome 
P450 3A5 (CYP3A5) genotype and expression of an individual is useful in 
determining drug efficacy, drug susceptibility and drug toxicity on an individual 
patient basis. 

25 The diagnostic and therapeutic utility of the present invention extends to the use of 
the assays, methods and kits of the present invention in determining and evaluating 
the expression of cytochrome P450 3 A5 (CYP3 A5). The expression of cytochrome 
P450 3 A5 (CYP3A5) in an individual is useful in determining and evaluating an 
individual's ability to metabolize or tolerate certain drugs, agents. 
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chemicals or compounds. The expression of cytochrome P450 3A5 (CYP3A5) in an 
individual is particularly relevant in determining drug sensitivity, efficacy and 
toxicity in patients undergoing, for instance cancer or immunosuppressant treatment. 

5 Thus, in instances where it is desired to reduce or inhibit the metabolism of a drug, 
compound or agent, the genotype and expression of cytochrome P450 3A5 
(CYP3A5) is determined, and the suitability of therapeutic use and dose of any such 
drug, compound or agent is thereby determined. Conversely, in instances wherein it 
is desirable to activate or enhance the metabolism of a drug, compound or agent, the 

10 genotype and expression of cytochrome P450 3 A5 (CYP3 A5) is determined, and the 
suitability of therapeutic use and dose of any such drug, compound or agent is 
thereby determined . 

As used herein, "pg" means picogram, "ng" means nanogram, "ug" or "ng" mean 
15 microgram, "mg" means milUgram, "ul" or >1" mean microliter, "ml" means 
milliliter, "1" means liter. 

The invention may be better understood by reference to the following non-limiting 
Examples, which are provided as exemplary of the invention. The following 
20 examples are presented in order to more fully illustrate the preferred embodiments of 
the invention and should in no way be construed, however, as limiting the broad 
scope of the invention. 

F.X AMPLE 1 

Variable and polymorphic expression of CYP3 A5 may be a determinant of 
diverse pathological processes. 

The ubiquitous expression of CYP3A5 (but not CYP3A4) in human kidney, lung, 
brain and polymorphonuclear leukocytes (PMNs) suggest that CYP3 A5 has an 
30 important physiological function in these tissues. Indeed, CYP3 A5 can mediate the 
metabolism of Cortisol to 6P-hydrozycortisol, a physiological regulator of Na^ 
transport in renal epithelia (Grogan, W.M. et al. (1990) AM J Physiol 258: C480- 
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488). Thus, CYP3A5 is involved not only in a detoxification pathway, but, in organs 
such as the kidney, may also serve an important function in regulating the pool of an 
endogenous physiological regulator. Hence, the extensive inter-individual variation 
in CYP3A5 expression in human kidney may contribute to individual differences in 
the localized generation of 6p-hydrozycortisol within the nephron (Schuetz, E.G. et 
al., (1992) Arch Biochem Biophys 294: 206-214; Haehner, B.D. et al., (1995) ISSX 
Proceedings 8: 352 (Abstr)). Thus, it is possible that Cortisol 6p-hydroxylation 
could play an etiologic role, by increasing the tendency to renal Na^ retention, in a 
subset of essential hypertensives, e.g., salt-sensitive hypertension. 

Variable and polymorphic expression of CYP3A5 may be a risk factor for 
susceptibility to environmentally related diseases. 

The CYP3A5 polymorphism is a locus for variable interaction between individuals 
and the environment. There are examples of CYP3A5 catalytic selectivity. For 
instance, the rate of 3(x-hydroxylation of aflatoxin Bl(inactivation step) by CYP3A5 
is only approximately 1% that of CYP3A4, while CYP3A5 efficiently catalyzes the 
8,9-epoxidation step (activation step) of aflatoxin Bl metabolism (GiUam, E.M. et 
al. (1995) Arch Biochem Biophys 317: 374-384). It has been speculated that in 
humans for whom CYP3A5 is the dominant hepatic CYP3A expressed, the lack of 
aflatoxin Bl detoxification by 3A4 but good activation by CYP3A5 might enhance 
aflatoxin's capability for genotoxicity (Gillam, E.M. et al. (1995) Arch Biochem 
Biophys 317: 374-384). Thus, phenotypic differences in CYP3A5 metabolism of 
environmental chemicals may be a risk factor for susceptibility to xenobiotic 
toxicities. 



Variable and polymorphic expression of CyP3A5 contributes to variation in the 
disposition of CYP3A substrates. 

CYP3 A5 exhibits catalytic activities that are qualitatively similar to CYP3A4 for 
most drugs (Gillam, E.M. et al. (1995) Arch Biochem Biophys 317: 374-384). 
0 Therefore, individual variation in CYP3 A5 contributes to the overall fate of these 



53 

CYP3A substrates and thus, may be an important determinant of their 
pharmacokinetic and pharmacodynamic variabiUty. 

CYP3A5 is polymorphically expressed in human Hver with 20-25% of Caucasians 
expressing this protein. Approximately 75% of Caucasians do not express CYP3A5 
in liver. CYP3 A5 is polymorphically expressed in kidney with 25% of individuals 
expressing higher amounts of 3A5 and 75% expressing just detectable amounts of 
3A5 protein. CYP3A5 is the major cytochrome P450 form in tissues outside the 
liver and intestine. 
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In summary, inasmuch as CYP3A5 is the predominant CYP3 A expressed in kidney, 
brain, lung and PMNs, and in the liver and intestine of some individuals, these 
resuhs provide the basis for understanding any molecular defects in the CYP3A5 
gene sequences that may be responsible for its polymorphic expression in liver. 
15 Ultimately, identification of the molecular basis for this polymorphism will allow 
development of simple DNA based tests to determine the association of CYP3 A5 
polymorphism to general health, individual differences in drug disposition or as a 
risk factor for developing certain diseases. 

20 Cytochome P4503A4 has been considered a significant enzyme in the metabolism of 
midazolam. Using heterologously expressed cytochrome P450 enzymes, Wandel et 
al confirmed the additional involvement of CYP4503A3 and 3A5 in midazolam 
hydrozylation (Wandel, C. et al. (1994) Br J Anaestn 73(5): 658-61). CYP4503A5 
expression was shown to increase metaboUsm of midazolam by a factor of 2.7 and 
25 showed far greater alpha- to 4-hydroxy lation of midazolam versus CYP4503 A3 or 
CYP4503A4. Interestingly, metabolism of nitrendipine, which is also metabolized 
by CYP4503A enzymes, inhibits liver microsomal hydroxylation of midazolam 
(Wandel, C.etal. (1994) Br J Anaestn 73(5): 658-61). Thus, CYP4503 A enzyme 
expression may be relevant to drug interactions in combined therapy. Moreover, 
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differential inhibition of CYP3A5 versus CYP3A4 may be more likely to lead to 
drug interactions in those expressing more CYP3A5. 



Extrahepatic expression of CYP3A5 may protect an individual from 
5 epipidophyllotoxin toxicity and the rislc of secondary AML by altering overall 

epipidophyllotoxin metabolism. CYP3A metabolizes epipidophyllotoxin (Relling, 
M.V. et al. (1994) Molec. Pharmacol. 45:352-358). A paper associating the NFSE-V 
allele with risk of epipidopyllotoxin-related leukemia (Felix, C.A. et al (1998) Proc. 
Natl. Acad. Sci. 95:13176-13181) speculated that the metabolism of EPI by 
10 CYP3 A4 would lead to formation of the "reactive catechol metabol ite" that might 
potentially damage DNA. However, our data would now suggest that the NFSE-V 
(CYP3A5 expressors), who did NOT develop secondary AML were protected from 
leukemogenesis by enhanced metabolism of the parent drug, both hepatically and in 
extrahepatic tissues. 

15 

Variable and polymorphic expression of CYP3A5 may influence the 
metabolism of estrogens and androgens in gonadal tissues. 

Moreover, because CYP3A metabolizes estrogens to 2-hydroxyesterone, 
4-hydroxyestrone, and 1 6alpha-hydroxylated estrogens, all of which have been 

20 implicated in estrogen-mediated carcinogenicity (Huang, Z et al Carcinogensis 1 9, 
867-872, 1998), variation in CYP3A may influence the circulating levels of these 
estrogens and the risk of breast cancer. In total, because CYP3A also metabolizes 
testosterone and estrogen, its expression in gonadal tissues may contribute to 
variable metabolism of steroids in these tissues and , hence to differences in the 

25 concentrations of circulating steroid and risk of disease in these tissues. 



CYP3A is expressed in prostate (Murray, G. I. et al (1995) J. of Pathology 
177:147-152, 1995). This analysis was by immunohistochemistry and would not 
have differentiated between CYP3A4 and CYP3A5. CYP3A5 alters metabolism 
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testosterone. CYP3A5 can metabolize testosterone (GiUam, E.M.J, et al (1995) 
Arch. Biochem. Biophys. 317:374-384). 



EXAMPLE 2 



10 



SEQUENCE DIVERSITY IN CYP3A PROMOTERS AND 
CHARACTERIZATION OF THE GENETIC BASIS FOR POLYMORPHIC 

CYP3A5 EXPRESSION 



ABSTRACT 



Variarton in the CYP3 A enzymes, which act in dmg meiaboHsm, influences levels 
of circulating steroids and therapeutic responses to half of all oxidatively 
15 metabolized drugs. CYP3A activity is the sum activity of the family of CYP3A 
genes, including CYPU5, which is polymorphically expressed a. high levels ,n a 
minority of Americans of European descent and Europeans (hereinafter referred to as 
•Caucasians'). Only people with at least one CYFUS', allele express large amounts 
of CYP3A5 Our findings show that smgle-nucleottde polymorphisms (SNPs) m 
20 CYP3AS'3 and CYP3A5'6 that cause alternative splicing and protein truncatton 
result in the absence of CYP3 AS from tissues of some people. CYP3 A5 was more 
frequently expressed m livers of African Americans (60%) than in those of 
Caucasians (33%). Because CYP3A5 represents a, least 50% otthe total hepaf c 
CYP3A content m people polymorphically expressmg CYP3A5, CYPU5 may be 
25 the most important genettc contributor to interindividual and interracial differences 
in CYP3 A-dependent drug clearance and in responses to many medtcmes. 



INTRODUCTION 
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Understanding the biological and medical implications of human genome sequence 
variation is a major goal of the Human Genome Project^ '^ Cytochromes P450 are 
particularly amenable for rapid identification of functionally important sequence 
variations, because Cytodrome P450 (CYP) catalytic activities can be determined by 
5 various in vivo and in vitro drug bioassays. Moreover, the applications of such 

information are particularly relevant for pharmacogenomics in which knowledge of 
SNPs in CYP genes may ultimately lead to individualized drug dosing and improved 
therapeutics^ In this report, we attempted to elucidate whether sequence variations in 
the promoters of genes encoding enzymes of the CYP3A family contribute to 
10 variability in CYP3A metabolism. 

t Z 

The cytochromes P450 evolved to catalyze the metabolism of numerous structurally 
"2 diverse exogenous and endogenous molecules. Approximately 55 different CYP 

Cn genes are present in the human genome and are classified into different families and 

vg 15 subfamilies on the basis of sequence homology. The CYP families have arisen 

J ^ through a process of gene duplication and gene conversion. Members of the CYP3 A 

13 subfamily catalyze the oxidative, peroxidative, and reductive metabolism of 

O numerous, structurally diverse endobiotics, drugs, and protoxic or procarcinogenic 

2 molecules". The CYP3A members are the most abundant CYPs in human liver and 

20 small intestine^*. Significant interindividual differences in CYP3A expression, 
exceeding 30-fold in some populations^ contribute greatly to variation in oral 
bioavailability and systemic clearance of CYP3A substrates, inclviding HIV protease 
inhibitors, several calcium channel blockers, and some cholesterol-lowering drugs. 
Variation in CYP3A expression is particularly important for substrates with narrow 
25 therapeutic indices, such as cancer chemotherapeutics^ and the immunosuppressants 
cyclosporin A and tacrolimus (FK506). Such variation in CYP3A can result in 
clinically significant differences in drug toxicities (e.g., nephrotoxicity) and response 
(e.g., graft survival). Moreover, because CYP3A metabolizes estrogens to 2- 
hydroxyesterone, 4-hydroxyestrone, and 16a-hydroxylated estrogens, all of which 
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have been implicated in estrogen-mediated carcinogenicity^ variation in CYP3A 
may influence the circulating levels of these estrogens and the rislc of breast cancer. 

Human CYP3 A activities reflect the heterogeneous expression of at least three 
5 CYP3A family members: CYP3A4, CYP3A5, and CYP3A7. The CYP3A genes are 
adjacent to each other on chromosome band 7q21, but the genes are differentially 
regulated'^. Two CYP3A pseudogenes, CYP3AP1 and CYP3AP2, were recently 
identified'". Functional CYP3A4 is found in most adults, with 10- to 40-fold 
variation in its expression. CYP3A7 is predominantly expressed in fetal life, and its 
10 expression appears to be silenced shortly after birth. Inexplicably, some persons 
express CYP3A7 mRNA into adulthood". CYP3A4 activity is correlated with 
CYP3A4 mRNA concentrations indicating that transcriptional control is the primary 
mechanism for regulating expression of the CYP3A4 gene. CYP3A5 was previously 
detected in a minority of adult Caucasian livers and small intestines; the basis for 
15 this "polymorphic" expression is unknown--. Recently two mutations in what was 
believed to be the CYP3A5 promoter were found to be associated with polymorphic 
CYP3A5 expression- However, sequencing of the CYP3A locus revealed that these 
SNPs are actually in the promoter of the pseudogene CYP3AP1 (ref 10) and, thus, 
camiot be the basis for polymorphic CYP3A5 expression. The CYP3A4, CYP3A5, 
20 and CYP 3 A 7 promoters contain multiple putative transcription factor binding sites. 
In particular, a recently discovered nuclear receptor PXR/SXR (pregnane X- 
receptor/steroid and xenbiotic receptor), that binds to a PXR response element in the 
CYP3A4 and CYP3A7 promoters is important for regulation of the CYP3A 
proteins'* '^ Therefore, we hypothesized that polymorphisms affecting CYP3A 
25 activity are present in DNA regulatory sequences. Through a systematic analysis of 
genomic DNA from a large number of patients with various drug metabolism 
phenotypes, and through analysis of DNA from the Coriell DNA Polymorphism 
Discovery Resource (DPDR, ref 1), we identified all common variants in the 
promoter regions of the CYP3A4, CYP3A5, CYP3AP1 and CYP3A7 genes and 
30 demonstrated ethnic differences in allele frequencies. By performing quantitative 
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immunoblotting and in vitro kinetic assays of drug metabolism, we evaluated the 
relationship between these common genetic variants and ethnic- and tissue-specific 
phenotypic variability. In addition, analysis of human liver CYP3A5 cDNA 
revealed that only those persons with a C7P5^5*7 allele produce high levels of full- 

5 length CYP3A5 mRNA and express CYP3A5. Those with the CYP3A5*3 allele have 
sequence variability in intron 3 that creates a cryptic splice site and results in the 
generation ofCYP3A5 exon 3B; this CYP3A5^3 allele codes for an aberrantly 
spliced mRNA with a premature stop codon. This finding explains the molecular 
defect responsible for one of the most common polymorphisms in drug-metabolizing 

10 enzymes. 

RESULTS 

We initially performed SNP analysis of the promoter regions of the CYP3A genes 
15 on DNA from the smallest DPDR ^subset (TABLE 3). The minimum DPDR panel 
contained five variant alleles with a frequency = 1% in the combined study 
population. These variant alleles included the CYP3A4'^1B allele, in which an A-G 
variant at nt -288 is present in the nifedipine-specific element of the promoter^ I We 
detected the CYP3API *7 reference allele (G at nt -44 in the promoter) in 43 persons 
20 (28 heterozygotes and 1 5 homozygotes). In an unusual haplotype, hereafter referred 
to as CYPSAT'lC (TABLE 3), an approximately 60-bp stretch (nt -129 to -188) of 
the CYP3A 7 promoter seems to have been replaced with sequence identical to the 
same region in the CYP3A4 promoter; this replacement suggests the occurrence of a 
gene conversion event 

25 

The Coriell DPDR panel was designed to represent the genetic diversity of United 
States residents who have ancestors from Europe, Africa, the Americas, and Asia. 
We found no variants in the DPDR panel that were not also seen in the combined 
study population, but additional CYP3A alleles found in our larger combined 
30 population were not present in the DPDR. Most of these alleles were rare, i.e., they 
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were seen in only one or two persons. To assess the usefulness of the DPDR panel m 
predicting SNPs, we calculated the probability that common variants would be 
missed in a random sampling of our combined population. The chance of missing the 
polymorphic alleles CYP3A4*1B and CYP3AP1 V was less than 5%, even if the 
5 sample consisted of as few as 10 study subjects from the combined population 
group. If we were to randomly select a sample of 25 subjects, the probability of 
missing polymorphic CTP3^7*7C alleles would be 1 5%, that of missing 
CYP3A7VE would be 25%, and that of missing CYP3A7VD would be greater than 
60% The probability of missing the rare alleles CYP3A4^1C, CYP3A4*1D, 
10 CYP3AP1 *IB and CYP3AP1 *1C would be greater than 60%. Thus the DPDR eight 
sample subset was more useful in accurately predicting the identity of CYP3A SNPs 
than a random sampling of 25 persons from the combined population. 

As has been previously reported-, CYP3A4VB occurred at a much higher frequency 
15 in African Americans (q=0.35) than in Caucasians (q=0.02; chi-squared = 48.9, 
P<0 001) We found no SNPs in the regions containing the proximal (nt -169 to 
-152) and distal (nt -7836 to -7208) (ref 21) PXREs ofCYP3A4. Sequence 
analysis of the CYP3A5 promoter (nt -^90 to -280) revealed two SNPs in the 5'- 
UTR (TABLE 3). CYP3A7*1C was three times more common in an African 
20 American population (q=0.06) than in a Caucasian population (q=0.03). However, 
among the 10 unrelated French Caucasians in the study, three persons were 
heterozygous for CYP3A7VC (q=0.15), whereas only two of the 96 unrelated non- 
French Caucasians carried one CYPi^ 7 *iC allele i,=0m.CYP3APlV wasthe 
„.ost common minor allele in Caucasians and African Americans. This allele is also 
25 divergently distributed among African Americans (q=0.50) and Caucasians (q = 
0 08 chi-squared = 51.1, P<0.001). By examining other panels of DNA, we found 
that the frequency of CYP3AP1 *1 alleles (q values) in Japanese was 0.15; m 
Chinese 0.3; in Mexicans, 0.13; in Southeast Asians (excluding Japanese and 
Chinese), 0.5; in Pacific Islanders, 0.3; and in Southwestern American Indians, 0.5. 
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We determined the association of CYP3A alleles with CYP3A phenotypes. None of 
the SNPs in the CYP3A5 or CYP3A4 promoters, including the CYP3A4*1B allele, 
were associated with altered CYP3A activities of persons whose phenotypes had been 
determined in vivo by the erythromycin breath test or nifedipine clesirance or in vitro 
5 by assays for total midazolam hydroxylase or for basal or rifampin-inducible 

testosterone 6 p -hydroxylase activity. Previous studies have noted the polymorphic 
expression of CYP3A5 in the liver and intestine of approximately 25% of persons 
studied'^"''^^ Although there is good concordance of CYP3AP1 *7 (G at nt -44) with 
high levels of CYP3 A5 in the liver'^ CYP3AP1 *1 cannot be responsible for 
10 polymorphic CYP3A5 expression. Because we did not find a CYP3A5 promoter 

variant that correlated with CYP3 A5 expression we determined whether CYP3AP1 *1 
was in linkage disequilibrium with another variant in the CYP3A5 gene that might be 
causal. Because the CYP3AP1 genotype is correlated with CYP3A5 expression•^ we 
first used mRNAs fi-om Caucasian and African American livers representing the 
1 5 CYP 3 API CYP3AP 1 *l/*3 and CYP3AP1 *3/*3 genotypes and created and 

sequenced the entire CYP3A5 cDNA. The CYP3A5 mRNA content was significantly 
greater in persons with a CYP 3 API *7 allele than in those who were homozygous for 
CYP3AP 1 *3. All livers contained the full length CYP3A5 mRNA transcript encoded 
by CYP3A5 (FIGURE 1). Many people carrying CYP3AP1 *3 also carried 
20 CYP3A5 * 3, which encodes only a small amount of normally splicesd CYP3A5 mRNA 
and a splice variant-1 mRNA (SVl). SVl results from a SNP at nt 22893 (in 
AC005020), that creates a cryptic consensus splice site and exon 3B, an exon derived 
from intron-3 sequences of the CYP3A5 gene (FIGURE 1, TABLE 4). The mutated 
sequence in intron 3 of CYP3A5*3 has a splice site prediction score of 0.9. Because 
25 of the premature termination codon in exon 3B, the SV 1 mRNA encodes a protein 
whose sequence is truncated after amino acid 102. We found a second splice variant 
(SV2) mRNA that encoded exons 3B and 4B in many persons carrying CYP3A5*3. 
Although sequence analysis around the predicted splice site for exon 4B revealed a 
SNP at nt 24035 (TABLE 4), this SNP and the generation of SV2 were not related 
30 (TABLE 4). A third splice variant (SV3) mRNA comigrated with SV2 in an agarose 
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gel and contained exons 3B, 4B, and 5B but not exon 6. No SNPs were found around 
exon 5B or exon 6 that correlated with the SV3 transcript. The fact that mRNAs with 
premature stop codons are more unstable and rapidly degraded^^ may explain the 
lower amount of CYP3A5 mRNA in persons homozygous for CYP3A5*3 in 
5 comparison to persons carrying at least one CYP3A5*1 allele. We examined the 
frequency of the CYP3A5V allele and found it was divergently distributed among 
African Americans (q=0.45) and Caucasians (q = 0.15). By examining other panels of 
DNA, we found that the frequency of CYP3A5*1 alleles (q values) in the DPDR 
subset of 8 people was 0.31; in Japanese was 0.15; in Chinese, 0.35; in Mexicans, 
10 0.25; in Southeast Asians (excluding Japanese and Chinese), 0.33; in Pacific 
Islanders, 0.35; and in Southwestern American Indians, 0.6. CYP3A5*6 was 
identified in 3/20 African Americans and encoded a normal CYP3A5 mRNA and a 
splice variant mRNA in which exon 7 was deleted. This deletion resulted in a ftame 
shift; thus the CYP3A5*6 mRNA encoded a protein that was truncated at amino acid 
15 1 84.' Sequence analysis of genomic DNA and cDNA synthesized from the spliced 
variant mRNA from those individuals skipping exon 7 (including sequencing of 
exons 6, 7 and 8 and 220 and 60 bp of flanking introns 6 and 7, respectively) 
identified a single sequence variation: a 30597G>A synonymous mutation in exon 7 
(nt711 intheC}T5^5cDNA). This genotype was concordant with the loss of exon 
20 7. This silent mutation may cause exon 7 skipping by disrupting an exonic splicing 
enhancer (Maquat, L.E. (2001) Nature Genet 27,5). 
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To determine the association of the CYP3A5 and CYP3AP alleles with CYP3A5 
5 expression, we measured CYP3A5 protein concentration in liver specimens. We found 
a greater frequency of CYP3A5 (> 21 pmol/mg protein) in African Americans (1 1/20; 
50%) compared with Caucasians (9/27; 33%). All Caucasians and most African 
Americans with a CYP3A5 content less than 21 pmol/mg protein were homozygous 
for the CYP3A5*3 allele with G at nucleotide 22,893 leading to cryptic CYP3A5 
10 mRNA splicing, whereas those with CYP3A5 content ranging from 21 to 202 

pmol/mg had at least one CYP3A5*1 allele (A at nucleotide 22,893 with no aberrant 
CYP3A5 mRNA splicing) (FIGURE 2A, 2B). Among the livers of Caucasians, the 
CYP3A5 allele frequencies conformed to Hardy-Wemberg Equilibrium. There was also 
complete concordance between the CyP5^5*7 ^dCYP3APl*l genotypes and 
15 between C}T5^5*5 and CrP3^P7*5 genotypes in Caucasians. Among African 
Americans, however, there were several outliers when the CYP3AP1 genotype was 
used to predict African American CYP3 A5 phenotype. Specifically, African- 
American liver 788 had a relatively high level of CYP3 A5 (54 pmol/mg protein) that 
was not predicted by a CYP3API *3/*3 genotype, but was explained by the 
20 CYP3A5V/*3 genotype. African American livers 624 and 958.29 had relatively low 
levels of CY3 A5 protein that were not predicted by a CYP3AP1 genotype, but 

were better explained by CYP3A5*l/*6 genotypes because the CYP3A5*6 allele 
generates a mRNA lacking exon 7. The final outlier (African American liver 50) had 
only a trace amount of CYP3 A5 protein that was not predicted by a CYP3AP1 *l/*3 
25 genotype or the CYP3A5*1/ *3 genotype. This discrepancy may be due to secondary 
degradation of CYP3A5 and other proteins following poor tissue preservation, or to a 
second mutation in the coding region of CYP3A5 that confers instability of the protein 
{e.g.,CYP3A5*2;Kf.2A). 

30 We examined the rates of midazolam hydroxylation by the same 47 Uvers. CYP3A4 
and CYP3A5 each produce two metabolites, I'-hydroxymidazolam (I'-OH MDZ) and 
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4.hydroxymidazolam (4-OH MDZ). The mean reaction velocities were 2.5-fold (P - 
0 03) higher for the livers of Caucasians and 2.2-fold (P = 0.19) higher for the livers of 
African Americans with at least one CYP3A5V allele, than for those Uvers of persons 
who were homozygous for CYP3A5*3. The ratio of I'-OH MDZ to 4-OH MDZ is 
5 dependent on substrate concentration and the type of CYP3A enzyme- At a 
„,idazolam concentration of 8 ;.M, the ratio was 5.5 when we used recombmant 
CYP3A4 and 16.1 when we used recombinant CYP3A5. Thus, with liver microsomes, 
the observed product ratio depends on the relative amounts of CYP3 A4 and CYP3 A5 
protein The average ratio of I'-OH MDZ to 4-OH MDZ was higher in Caucasian 
10 livers with at least one CYP3A5V allele (8.1 . 3.0 for n/*3 genotype); than m those 
with two CyP5^5*5 alleles (5.9 ±1.7 for *3/*3;P = 0.09). The average ratroofl'- 
OH MDZ to 4-OH MDZ was higher in African American livers with at least one 
CYP3A5-1 allele (9.8 ± 3.8 for n/*l,*3,*4 genotypes); than in those with two 
CYP3A5*3 alleles (8.3 ± 1.6 for *3/*3; P = 0.41). 

Measurement of CYP3 A .uant.ty in the same 47 livers showed that CYP3 A4 content 
was not significantly different between Caucasian and African American hvers. 
Within each ethnicity there was no difference in mean CYP3 A4 content between hvers 
with or without at least one CYP3A5V allele. The amount of total CYP3A m 
,0 Caucasian and AMcan Americans was approximately 3-fold ^^B^er 

CYP3A5*! allelethan that in persons not carrying this allele(P = 0.001 andP-0.01, 
.espectively). Moreover, in one-th.d of Caucasian Uvers and over on^-^^^of A^^^^ 
American Uvers, CYP3A5 protein represented more than 50o/o of the tota CY 3A 
content Thus, CYP3A5 represented a considerably greater proportion of total CYP3A 
25 than previously estimated'^ and contributed significantly to CYP3 A content and 
catalytic activity. 

Because CYP3A5 is v.laMy expressed i„ human imeste'.. we took a.va^«ge of a 
„„i<,ue cohon of paired samples of s-naU .n.estine »d liver to de«e „he*er .he 
^0 CYPMPl "J genowe-phenotype concordance is seen m exttahepatic fssues. All 
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persons with large quantities of CYP3A5 in the liver also had relatively high intestinal 
levels of CYP3A5 and were heterozygous or homozygous for the CYP3A5*1 allele. 
Further, the intestinal midazolam product ratio for persons with at leetst one CYP3A5 *1 
allele (9.6 ± 2.5 for the AG genotype) differed from that for persons with two 
5 CYP3A5*3 alleles (6.0 ± 0.3; P=0.06). 

Although the CYP3A4*1B allele was originally proposed to be associated with altered 
CYP3 A4 hepatic activity that proposal has been controversiaP^-". Since the 
CYP3A4VB, and CYP3APJ*! or CYP 3 A4- IB and OT5^5 *7 alleles can all be 

10 present in the same person we hypothesized that it is ultimately the CYP3A5 genotype 
that influences the overall functional activity of CYP3A. Afi-ican Americans 
frequently carried both the CYP3AP1 *7 and CYP3A4^1B alleles (TABLE 5), and 
using data from those persons homozygotic for CYP3A4VB or CYP3AP1 */. we 
determined the linkage of the two alleles (data not shown) = 12.8, /'<0.001). The 

15 association of the CYP3A5*1 and CYP3A4*1B alleles was not significant in African 
Americans, but approached statistical significance in Caucasians. 
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TABLE 5 

Presence ofCYP3A4*lB and CYP3AP1*! or CYP3A4*1B and CYP3A5'^1 

in One Person 




Caucasians 



CYP3A4*!/'*! 
CYP3A4*1/*1B 
or 

CYP3A4*1B/*1B 

African 
Americans 
CYP3A4*1/*1 
CYP3A4*1/''1B 
or 

CYP3A4*1B/*1B 



CYP3AP1 *3/*3 



At least one 
CYP3AP1 *1 allele 




CYP3A5*3/* 
3 




CYF3AP1 and CYP3A4 genotypes: 

Caucasians: = 6.08; P=0.01; African Americans: x - 12-26, P-0.0UO4. 

P^h^'eLT^^^^^^ AMcan Americans: Pis.ers exact test 

P=0.28. 



Atle 
CYP3A 



The CYP3A 7 promoter contains a set of seven tightly linked variants that replaced 60 
5 bp of the CFP5^ 7 promoter with the identical region from the CYP3A4 promoter; thus 
this replacement altered three transcription factor binding sites- HNF-5, an octamer 
motif, andthePXRE. Although CYP 3 A7 is generally expressed only during fetal 
development, CYP 3 A7 mm A is detected in a small fraction of adults". We 
speculated that these nucleotides are important for the increased e:.pression of hepatic 
10 CYP3 A4 and loss of CYP3 A7 expression in most persons after birth. We evaluated 
hepatic CYP3A7 mRNA expression in nine adult livers (five with the variant 
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CYP3A7*1C allele) to determine whether this replacement of CyP3^ 7 promoter 
elements by CYP3A4 sequences is related to expression of CYP3A7. The two persons 
with the highest levels oiCYP3A7 mRNA carried CYP3A7*1C, and four of the five 
persons with CYP3A7*1C alleles expressed CYP3A7 mKNA (data not shown). 
However, one person with the CYP3A7*]C allele had no CYP3A7 mRNA, and two 
persons lacking CYP3A7*1C had very low but detectable levels of CYP3A7 mRNA. 
Thus, CYP3A7VC is associated with CYP3A7 mRNA expression but is not the sole 
explanation of CFPi^ 7 expression in adults. 



10 DISCUSSION 



Hepatic CYP3A5 was first shown to be polymorphically expressed in but the 

molecular basis for this polymorphism has remained undefined. Our findings show 
that SNPs in the CYP3A5 gene that cause alternative splicing and truncation of 
15 CYP3 A5 protein are the molecular basis for the absence of CYP3 A5 protein in some 
persons The most common cause of the loss of hepatic CYP3A5 expression is a SNP 
at nt 22893 (AC005020) in intron 3 of CYP3A5*3; this SNP generates a cryptic splice 
site and exon 3B. Splicing of this exon into the CYP3A5*3 transcript results in a shift 
in the open reading frame and introduction of a stop codon. Thus, translation of the 
20 abnormal transcript generates a protein that is prematurely terminated after amino acxd 
102. The new CYP3A5*6 allele, which was identified in African Americans, contamed 
a G3o,.,A mutation in exon 7; this SNP correlates with the deletion of exon 7 firom the 
CYP3A5 mRNA and lower CYP3A5 catalytic activity in three African Americans. 
CYP3A5 expression was also closely associated in Caucasians with the CYP3AP1 *7 
25 genotype'"-'^ However, the CYP3A5 phenotype was better predicted by CYP3A5 
genotype in African Americans and comparison of CYP3API and CYP3A5 genotypes 
among the DPDR subset revealed only 37.5o/o linkage; in Mexicans only 65% linkage 
and 90-95% linkage in Japanese, Southeast Asians, Chinese and Pacific Islanders. 
Although we detected small amounts of normally spliced CYP3A5 mRNA and 
30 resulting protein even in those persons homozygous for CYP3A5^3, cryptic splices 
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sites wUhin in.ron 3 were preferentially used. Thus. o„ly those persons with a. least 
one CYPSAS'l allele polymorphically express high amounts of CYPHA5. These 
results add to the growing lis. of splicing alterations that affect the expression of 
clinically important genes^'. 

' In addition ,0 determining the basis for polymorphic CYP3A5 expression, we 
demonstrated that in persons with at leas, one CYPSAS'l allele. *e peak hepatrc 
content of CYP3 A5 is much higher than previously thought- and is approximately 
the same as the reported peak level of CYP3A4 (ref 6). For the majority of 

10 Caucasians and Afrrcan Americans who ca^ the CYPSAS', allele, we also foun 

contribute significanUy to the total metabolic clearance of the many CYF3A 
substrates. Indeed, we predict that persons „,th the highest clearance and lowest oral 
bioavailability of CYP3A substrates w,l, be heterozygous or homo.^gousfor 
,5 CYF3A5V Tlrese persons might be more likely to encounter a lack otefficacy ftom 
Jlddoseofactiveparentdrug.Mo,eover.becauseCYP3Ma„dCYP3A5 ave 

:!arying degrees of catalytic capabihty .rd regrcselectivity toward some substrates and 
because CYP3 A4 and CYP3A5 are differentially inlribited by some ^or^^^- 
polymorphic CYP3 A5 expression will contribute to differences m metabohte profiles 
,o:LsLp.,bill.y.o,nhibitory drug interactions, one can also envision an «^ 

r,sk of adverse effects ftom CYP3 A5-gerterated toxic metabolites m persons that 
predominanuy express cyP3A5. For instance, the rate ""-^y'^f^lZ 
aflatoxin B, (the inacivation s«p) by CYP3A5 is approximately 1% of that of 
CvTj A4 Whereas CYP3A5 cMciently catalyzes the S.9-epoxida.ion (the aCvatton 
,5 step)" of'aflatoxin B,. Thus, in those for whom CYP3A5 .s the dominant hepaUc 
CYP3A the relative lack of aflatoxm B, detoxification comparedto actrvatronof th,s 
gim might enhance afiatoxr.s genotoxrcty" Because polymorphic CYP3A5 ,s one 
rlcontrrbutingto individual variation inCYP3A.med.atedmet.bosmo^^^^ 
simple DNA-ba«d tests can rrow be used to determine how mdrv.dual drfferences 
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in CYP3A5 conMbute to overall ™,abol,c ta,e of these CYP3A subs^tes. .o .heir 
pharmacodynamic variability, and to disease risk. 

CYP3A5 is the primary CYP3 A family member expressed outside the liver and 
5 intestme (e.... Kidney, lu„g and polymorphonuclear leukocytes,; this feature suggests 
that CYP3A5 has an important physiologic function in these tissues. Indeed, 
CYP3A5 can mediate the metabolism of Cortisol to 6P-hydroxycor.isol, a phys.ologrc 
„gula,or of Na* transport in renal epithelra- Variable and polymorphic renal 
expression of CYP3A5 (refs. 35, 36) could contribute to individual differences m the 
10 localized generation of 6p.hydroxycortisol within the neptoon and could play an 

iologie lie, for example, in salt-sensi.ive hypertension by increasing rena, retenfon 
otNa- Thus, CYP3A5 is not jus. a catalyst of drug detoxification, but in organs such 
as .he kidney, CYP3 A5 may serve an impor.an. fimcfon in regulating the pool of 
endogenous paracrine or endocrine factors. S.milarly, polymorphic expresston o 
,3 CYP3A5 could contrrbute to variable metabollsnr of s.erolds in the pros.te=md breast 
and .0 drfferences in the concentrations of crculatmg steroids, and hence, rrsk of 
disease in these tissues. 

.mong the Caucasians and Africa. Amerrcans h. this study, the CyFSA,'> genotype 
JO was associated with significant levels of CYP3A5 protein in the liver and sma 1 
m.est,„e, the two main tissues in which CYP3 A5 ts likely to 

contribu.ion to drug ehmination. The polymorph.c distrrbutton of . e 'j^^^ 
3ugges.stha,relativelyhighleve,s„fmetabo,,cally-ac.,veCYP3A5aree^^^^^^^^ 

an estimated 30./. of Caucasi=.s. 30«/. of .lapanese, 30% of Mextcans. 0% of Chtnese 
.5 Id more than 50% of Afiican America, Soudteast Asians, Pacific Islanders, and 
ou.hwes.em American Indians. The h.gher prevalence of CYP3A5 expressron 
u ges.s dra. these non-Caucasians are more hkely to experience higher clearance of 
suggests m rvPIA- are less likely to experience dose-hm...ng 

drugs principally inacvated by CYP3A, are less I, y v cyp3A5 

.oxicitles; andhave dtfferentrisks of diseases thatare associated 
30 expressorpheno.ype.Naturalselectioncoulddrivetheh.g f^,^^^^^^^^^ 
alleles in African Americans and equatorial edrmc groups rf, for exarnple. 
causal link between CYP3A5 expression and renal metabolism of endogenous 
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molecules that affect Na^ retention-. Such a link might confer a selective advantage in 
areas of water shortage. 

Using the CYP3A4*IB allele as a marker, several investigators have postulated that 
5 CYP3A4 is a candidate gene in the development of several disorders including prostate 
cancer and epipodophyllotoxin-induced secondary acute myelogenous leukemia--^^ ". 
However, our finding of the simultaneous occurrence ofCYP3A4*lB , CYP3AP1V 
and CYP3A5V alleles in some persons suggests the need to reevahiate this 
association and to determine whether the CYP3A5 *I allele and CYP3A5 expression is 
10 more closely associated with these disease risks. 

Data on the frequencies of SNPs in human genes have been largely gathered from 
surveys of exons and introns-^s ^^^h little specific attention given to the frequency of 
SNPS in upstream regulatory regions, particularly those of differentially regulated gene 
15 family members. Previous studies have reported a nucleotide diversity of about 1 m 
2 000 bp of noncoding DNA (consisting mostly of 5' and 3' untranslated regions and 
introns) and lower frequencies in coding sequences; the least frequent SNP, wrth a 
nucleotidediversity of approximately 1 in 20,000 bp, was the codmg SNP that causes 
a nonconservative ammo acid substitution3«. i„ Caucasions, the observed nucleot.de 
20 diversity in the CYP3A4 promoter was 1 in 7,246 bp; m the CYP3API promoter, 1 m 
2 577 bp- and in the CYP3A5 promoter 1 in 3,175 bp; and in the CYP3A7 promoter, 1 
in 4 444 bp In African Americans the observed nucleotide diversity in the CYP3A4 
promoter was 1 in 1,000 bp; ir. the CYP3API promoter, 1 in 649 bp; in the CYP3A5 
promoter, 1 in 2,841 bp; and m the CYP3A7 promoter, 1 xn 1,677 bp. It will be of 
25 interest to compare the level of nucleotide diversity wUhin the CYP3A promoters w.th 
a much larger diverse survey of human gene promoters. 

Traditionally, pharmacogenetics research used the strategy of identifying outliers in 
drug response, obtaining the DNA from outliers, and identifying sequence varrat.on m 
30 genes involved in that drug's metabolism. In the post-genomic era, reverse 

pharmacogenetics approaches are underway m which common polymorphisms are first 
identified in panels of DNA from anonymous persons and then screening for these 
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cormnon sequence variations is performed in persons «hose drug metabolism 
phenotype is known. For genes of pharmacogenetic interest, or for a,.y gene, the fewer 
the number of DNA samples required to detect these common genetic vairants, the 
more rapidly and cost efTeetively common variants can be identified. Moreover, these 
5 common variants are proposed to signifie^tly contribute to nsks of common disease . 
Our results confirm that the smallest subset of eight DPDR samples was suffictent to 
identify al, common polymorphisms in the CYP3A promoters and in the CYPm gene, 
including two polymorphisms tat had a high probability of being missed in a random 
population sample of 25 persons. Ws result indicates the robt.t power of thrs 
10 resource for rapid SNP discovery. 



METHODS 



Population samples 

15 We sequenced the CYP3A promoters of i 59 persons. Eight samples were fiom the 
DPDR (Coriell Cell Repositories). We also obtained samples from the followmg 
sources: 47 livers (27 Caucasian donors, 20 African American dorrors) whose 
mida^olam 1 '-hydroxylase and 4-hydroxylase activities and CVP3A3 CYP'A 
protein were characterized (Untversity of Washington, Univers.ty of Ptttsburgh, the 
20 Medical College of Virginia, and St. ,ude Children's Research Hospital); pnmary 

human hepatocytes from eight people whose basal and d™g-,„ducible tes— 6,- 
Mroxyile activtty had been characterised CUniversity of Pittsbttrgh); Mneys fiom 
12 people (Indiana University); 29 subiects (representatives of 7 «hes) whose 
mfedipine clearance had been determmed (University of Newcastle upon Tyne, UK) 
„ L f miiy members, one of whom was a poor nifedipine metabolizer (University f 
Michigan, 12 pediatric patients (St. ,ude Children's Research Hospital) H sublets 
Whose hepatic CYP3 A phenotype had been determined by using the erythromycm 
Lath tes (University of Michigan); 10 subiects whose intestinal CVP3A4 content 
hi been determined (Umversity of Michigan); hepatocytes .om 10 
30 the drug induction of CYP3A protein had been characterized (INSERM an Medi^ 
college Of Virginia); , , pairs of small intestine and liver (some liver sampl. a„ t^ 
same as those mentioned above) and intestines of 3 Caucasians whose midazolam 
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hydroxylase and A-hydroxylase activities and CYP3 A5 protein had been characterized 
(University of Washington). We sequenced the CYP2AP1 promoter and CYP3A5 
introns 3, 4 and 6 in samples from 10 Mexicans, 9 Japanese, 10 Chinese, 10 Southeast 
Asians 6 Pacific Islanders and 5 Southwestern American Indians (Human Vanation 
5 Panels, Coriell Cell Repositories). Informed consent was obtained from all persons 

and approval of this research was obtained according to the relevant institutional 

guidelines. 

Primers and sequencing 

10 We used primers (Waiaws-gaan^^^ '° ^ 

desigr, primers .o amplify *e promoter regionsofCrW... cy«.5, CVPSAP, and 

CYP3AT fte amplicor,s were 400 to 600 bp m length. The sequences of the pnmers 
were: CYPSA^ (f) S^-TGOGATGAATTTCAAOTATTTTG-S' (SEQ ID N0:1) and (r) 
5..AGGTTTCCATOGCCAAGTCT-3' (SEQ ID N0:2); CYP3A4 primers to sequence 

15 the distal PXRE sequences (nt -7836 to -7208) (f) 5'- 

CCGATCAGAATAAGOCATTG-3' (SEQ ID N0:3) and (r) 5'- 
GATTCACCTGGOGTCAACAC -3' (SEQ ID N0:4); CYPSAPl prrmers (f) 
5.-GGGGATGGATTTCAAGTATTCTG -3' (SEQ ID N0:5) and (r) 
y-GTCCATCGCCACTPGCCTTCT -3' (SEQ ID N0:6); CYPU7 pnnrers (f) 
20 y-GTCTGGCTGGGTATGAAAGG .3' (SEQ ID N0-.7) and (r) 
5-.GCCAAGTTTGGGATGA0AT -3' (SEQ ID NO:8); CYP3A5 (f) 
S'-GAGGATGGATTTCAATTATTCTA -3'(SEQ ID N0:9) 

a„d(r) 5'-GTCCATCGCCACTrrCCrrC-3'(SEQIDNO.10). Forward and revere 
primers were tailed with universal sequenerng primers (-40 MI3 and-28MI3, 
25 respectively).Primerpairswereusedfor35cycles.oamp.ifygenomieDNA^ The 
following conditions were used in each cycle: 95 "C for 15 sec. 61 "C for 30 sec. and 
72 -C for 5 min. We removed unincorporated nucleotides and primers by incubattng 
the PCR product with shrimp alkaline phosphatase and exonuelease for 30 mm a, 37 
=0 followed by 15 min at 80 'C ». Primers for amplification of the full-length 
30 CYPSA, CDNA were (f) PI, 5.-AACAGCCCAGCAAACAOCAGC-3'(SEQ ID 
^0:11, and (r) P2. 5--TAAGCCCATCTTTATTTCAAGGT-3. (SEQ ID NQ:,2). 
Primes for sequencing the CYP3A5 cDN A were 
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P3, 5'-GTTGCTATTAGACTTGAGAGGACT-3'(SEQ IDNO:13); 
P4' 5'-TGTAAGGATCTATGCTGTCCTTC-3' (SEQ ID NO:14); 
Ps', 5'-CACAAATCGAAGGTCTTTAGGC-3' (SEQ ID N0:15); 
P6 5'-TCAAAAACTGGGGTAAGGAATG-3' (SEQ ID NO;16); 
5 P7', 5'-GCCTAAAGACCTTCGATTTGTG-3' (SEQ ID NO:17); 
P8 5'-CATTCCTTACCCCAGTTTTTGA-3' (SEQ ID NO: 18); 
P9 5'-AGTCCTCTCAAGTCTAATAGCAAC-3' (SEQ ID NO: 19); 
PIO 5'-GAAGGACAGCATAGATCCTTACA-3' (SEQ ID NO:20); 
Pll' 5'-CAGGGTCTCTGGAAATTTGACA-3' (SEQ IDN0:21); 
10 PI2' 5'-TCATTCTCCACTTAGGGTTCCA-3' (SEQ ID NO:22), and 

PI3' 5'-CAGCATGGATGTGATTACTGGC-3' (SEQ ID NO:23). The primers used 
to amplify CYP3A5 exon 3B, 4B and 5B insertions from genomic DNA were 
5020 22719 (f) 5'-CCTGCCTTCAATTTTTCACTG-3' (SEQ ID NO:24) and 
5020'24161 (r) 5'-GCAATGTAGGAAGGAGGGCT-3' (SEQ ID NO:25). The nested 
15 primers used to sequence the nt 22,893 site were 5020_22743 (f) -V- 
TAATATTCTTTTTGATAATG-3' (SEQ ID NO:26) and 

5020 23205 (r)5'-CATTCTTTCACTAGCACTGTTC-3'(SEQIDNa^^^^^^^^^ 

neste'd sequencing pnmers used to sequence the nt 24,035 s.e -^^^^^^ 
5'-CAACAAAAACCGGCAAACTG.3'(SEQIDN0.28) and 5020_24135 (r) - 
20 AGGATTTTCAGACTTAACAC-3' (SEQ ID NO:29). The primers used to ampUfy 
the exon 7 deletion in CrP3^5*6 were 5020,28814 (f) 5'- 

GGTCATTGCTGTCTCCAACC-3' (SEQ ID NO:30) and (r) the P6 pnmer (SEQ ID 
NO-16), and to sequence across exon 7 5020_30237 (f) 5'- 
TATGACTGGGCTCCTTGACC-3' (SEQ ID NO:31) and 5020..30745 (r) 5 - 
25 TGGAATTGTACCTTTTAAGTGGA-3' (SEQ ID NO:32). 

we sequenced the pro.mal promoters by performing standard "-^^^ 

, . u nvP Primers The distal promoter region otcri J/i'* 

seauencing with Amersham ET Dye Primers. 

1 *e Cr«.5 pro^o... were se,uenced usin, B.gDye Te— se,ue^.n. 
30 and producs we„ resolved by po>ya„,1a.ide gel elee,rcphores.s or by cap« 
eJophoresi. Tbe resu,.an. trace files were base-called by pbred and assenrbl d by 
iap cLp://www.e„o.e.wasb,„^on.edu,. Polypl.ed» was used ro derect po.en.,al 
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heterozygosity. To be a true variar^t, the variar^t-containing sequence generated by the 
forward primer had to be identical to that generated by the reverse primer. 

Western blot analysis 

' we perfonned quantitative i—lcing of CYP3 A5 and CYP3A4 content of tissue 
preparations as described purified cDT^A-expressed CYP3 A5 (a gift from R. Peter) 
and CYP3A4 purified from human liver were tlte reference standards". Uver 
microsomes (.0-20 .g) and intestinai homogenates (50 Mg), both of which had been 
10 prepared from organ donor tissue", were resolved by electrophoresis, and CYP3A5 
was detected wi* »ti.CYP3A5 antibody (Gentest) or anti-CYPJ A4 anttbody . We 
determined the integrated optical density of each band by using a Bio-Rad ChemtDoc 
and Quantity One program. 

15 Midazolam kimlic protocol .^.^hv Roche 

Midazolam. .'N.midazolam. T-OH MDZ and 4-OH MDZ were prov.ded by Roch 
Laboratories. All incubations were performed tn duplicate tn solutions ccntammg O.f 
M potassium phosphate (pH 7.4) and 20 to 100 .g human liver -^"osomes or 0 to 
,00 Kg intestinal homogenates or 10 pmol of recombinant CYP3A4 or CYP3A5 
.0 (Gentest). Midazolam (final concentraficn, 8 .M) was added to the drluted ttssue 
preparations, and the m.xtures were preincubated a. 37 'C for 5 min. We added 
LdPH (final concentiation, 1 mM) ,0 tnitiate the reaction. Incubation of human 
.„er microsomes lasted for 2 min; tneubation of recombinant CYP3A artd tntesti^ 
samples lasted for 4 mh.. Reactions were terminated by the addition of 1 mL of ,ce- 
25 cold01MNa,CO,.pH~n.WensedNC,GC-MS.omeasure,he,uan«tieso 

rlacted r-OH MDZ and 4-OH MDZ (rcf. 14). Reaction velocities and product ratios 
are presented as mean . SD. Statisticai compansons of mean data for different 
genltypes were performed by ustng a 2-s,ded, unpatred , test with unequal vartances. 



30 

Reverse Transcription-PCR of CYP3A7 
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Total RNA (5-10 i^g) from human liver was reverse-transcribed according to the 

manufacturer's instructions (Life Technologies). C7P3^7 cDNA was amplified from 

first-strand cDNA by using oligonucleotides CYP3A7(S) 5'- 

ATTCCAAGCTATGTTCTTCATCAT-3' (SEQ ID NO:35), and CYP3A7(AS) 5'- 
5 AATCTACTTCCCCAGCACTGA-3' (SEQ ID NO:36), under described conditions 
except that the initial denaturation lasted 5 min, the amiealing temperature was 58 "C, 
and the reaction required 25 cycles. The PGR product was analyzed on agarose gels. 
Amplification of 28S rRNA served as a control for RNA integrity. 



10 

GenBa„kacce..io„ numbers. p,omo«r, S74700; Cy^W promoter 

consensus sequence, AF35929. 
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FX AMPLE 3 

PCR-RFLP BASED METHOD FOR IDENTIFYING CYP3A5n, 
CYP3A5*3 AND CYP3A5*6 ALLELES 

20 we developed a mismatched PCR-RFLP based method to eharaeterize the CYPSAS'l. 
CYPSA5-3 CYP3A5'6 polymorphism suitable for large seale sereentag and cUmcal 
testing appiieatioas. To distinguish the CYP3A5', and CYPSAS'S alleles, nested PCR 
is performed using a mismatched forward primer 5020.22871 (f) (5' - 
TAAAQAGCTCTnTOTCTTTlA-3') (SEQ ID NO:33) and the reverse pnmer 
25 5020 23205 (r)5--CATTCTTTCACTAOCACTGrrC-3- (SEQ ID NO:27). The 
. mismatched forward primer introduces a mismatch "T" (denoted underlined) at 
nucleotide 22S91 in all CYP3 A5 alleles, but creates a unique Tru9l and/or M.el 
.estrtction site m the CYP3A5V expressor-22893A allele (TTAA). but not in the „on- 
expressor-G allele (TTAG). Susceptibility of the PCR product to cleavage by Tru9l 
30 and/or Msel indicates the presence of the CYP3A5V allele. The sequences of the 

primers and dte PCR-RFLP method are diagramed in FIGUKES 3 and 4. Homozygous 
(.,/• 1) versus heterozygous (• 1/.3) individuals can be distinguished by the presence of 
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unoleaved full length PCR product (334 bp) in he.ero.ygotes, ■„ addition to the smalier 
cleaved DNA fragments (3 14 bp and 20 bp). 

To distingnish the CYPMS'I and CYP3A5'6 alleles, nested PCR is pertornted using a 

5 mismatched forward primer 5020_30569 (f) (5' - 

CACAAGACCCCTTTOTOGAOAGCACTIAJ') (SEQ ID NO:34) and the reverse 
prtnter 5020 30745 (r) 5--TOGAATTGTACCTTTTAAGTGGA-3- (SEQ 10 NO:32). 
Tl,e mismatAed forward primer introduces a mismatch -T" (denoted underlined) at 
nucleotide 30595 in all CYP3 A5 alleles, but creates a unique Tru9l and/or Msel 
10 restrictron site h, the CyP,A5'6 „on-expressor-30597A allele (TTAA), but no. m the 
CYF3A5', exprcssor-G allele (TTAG). Susceptibility of the PCR product to cleavage 
by Tru9l and/or M.el indicates the presence of the CyP3A5'6 allele. The sequences of 
the primers and the PCR-RFLP method are diagramed in FIGURES 5 and 6. 
Homozygous (.3/.3) versus heterozygous (• ./•3) individuals can be distinguished by 
15 the presence of uncleaved full length PCR product (177 bp) in heterozygotes, m 
addition to the smaller cleaved DNA firagments (151bp and 26 bp). 



20 Alternatively, either of the above amphftcations can be done in two round PCR, w,th 
firs, round amplification for CYP3A5'i: CYP3A5'3 analysis using primers 
5020 22719 (f) 5-.CCTGCCTrCAATTTTTCACTG-3- (SEQ .DNO:24) and 
5020:2416, (r) S'-OCAATGTAGGAAOGAOGGCT-J- (SEQ ID NO:25) ,and for 
CYP3A5*1I CYP3A5*6 analysis, primers 5020_28814 (f) 5'- 
25 GGTCATTGCTGTCTCCAACC-3' (SEQ ID NO:30) and (r) the P6 primer(SEQ ID 
NO. 16). The second rotmd .nplifieation is then performed as noted above, followed 
directly by Tru9 VMse I restriction enzyme digestion. 
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Various references are cited throughout this Specification, each of which is 
incorporated herein by reference in its entirety. 
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Although the foregoing invention has been described in some detail by way of 
illustration and example for purposes of clarity of understanding, it will be obvious 
that certain changes and modifications may be practiced within the scope of the 
appended claims. 



